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wo 01/46256 PCT/USOO/34919 

VESICLE TRAFFICKING PROTEINS 



TECHNICAL FIELD 

This invention relates to nucleic add and amino acid sequences of vesicle trafficking proteins 
5 and to the use of these sequences in the diagnosis, treatment, and prevention of vesicle trafficking 
disorders, autoimmune/inflammatory disorders, and cancer, and in the assessment of the effects of 
exogenous compounds on the expression of nucleic acid and amino acid sequences of vesicle 
trafficking proteins. 

10 BACKGROUND OF TBE INVENTION 

Eukaryotic cells are bound by a lipid bilayer membrane and subdivided into functionally 
distinct, membrane-bound con5)artments. The membranes maintain the essential differences between 
the cytosol, the extracellular environment, an(^ the lumenal space of each intracellular organeUe. As 
lipid membranes are highly impermeable to most polar molecules, transport of essential nutrients, 

15 metabolic waste products, cell signaling molecules, macromolecules, and proteins across lipid 

membranes and between organelles must be mediated by a variety of transport-associated molecules. 

Integral membrane proteins, secreted proteins, and proteins destmed for the lumen of 
organelles are synthesized within the endoplasmic reticulum (ER), delivered to the Golgi complex for 
post-translational processing and sorting, and then transported to specific intracellular and 

20 extracellular destinations. Material is internalized from the extracellular environment by endocytosis, 
a process essential for transmission of neuronal, metabolic, and proliferative signals; uptake of many 
essential nutrients; and defense against invading organisms. This intracellular and extracellular 
movement of protein molecules is termed vesicle trafficking. Trafficking is accon^lished by the 
packaging of protein molecules into specialized vesicles which bud from the donor organeDe 

25 membrane and fiise to the target membrane (Rothman, JJE and F.T. Wieland (1996) Science 
272:227-234). 

The transport of proteins across the ER membrane involves a process that is similar in 
bacteria, yeast, and mammals (Gorlich, D. et al. (1992) CeU 71: 489-503). In mammalian systems, 
transport is initiated by the action of a cytoplasmic signal recognition particle (SRP) which recognizes 

30 a signal sequence on a growing, nascent polypeptide and binds the polypeptide and its ribosome 
con5)lex to the ER membrane through an SRP receptor located on the ER membrane. The signal 
peptide is cleaved and the ribosome complex, together with the attached polypeptide, becomes 
membrane bound. The polypeptide is subsequenfly translocated across the ER membrane and into a 
vesicle (Blobel, G. and B. Dobberstein (1975) J. CeU Biol. 67:852-862). 

35 Proteins implicated in the translocation of polypeptides across the ER membrane in yeast 
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include SEC61p, SEC62p, and SEC63p. Mutations in the genes encoding these proteins lead to 
defects in the translocation process. SEC61 may be of particular in^ortance since certain mutations 
in the gene for this protein inhibit the translocation of many proteins (Gorlich, supr^. 

Mammalian homologs of yeast SEC61 (mSEC61) have been identified in dog and rat 

5 (Gorlich, supra). Mammalian SEC61 is also structurally similar to SECYp, the bacterial cytoplasmic 
membrane translocation protein. mSEC61 is found in tight association with membrane-bound 
ribosomes. This association is induced by membrane-targeting of nascent polypeptide chains and is 
weakened by dissociation of the ribosomes into their constituent subunits. mSEC61 is postulated to 
be a con^nent of a putative protein-conducting channel, located in the ER membrane, to which 

10 nascent polypeptides are transferred following the completion of translation by ribosomes (Gorlich, 
supra) . 

Several steps in the transit of material along the secretory and endocytic pathways requires 
the formation of transport vesicles. Specifically, vesicles form at the transitional endoplasmic 
reticulum (tER), the rim of Golgi cistemae, the face of the Trans-Golgi Network (TGN), the plasma 

15 membrane (PM), and tubular extensions of the endosomes. Vesicle formation occurs when a region 
of membrane buds off fi-om the donor organelle. The membrane-bound vesicle contains proteins to 
be transported and is surrounded by a proteinaceous coat, the coiiq)onents of which are recruited from 
the cytosol. Vesicle formation begins with the budding of a vesicle out of a donor organelle. The 
initial budding and coating processes are controlled by a cytosolic ras-like GTP-binding protein, 

20 ADP-ribosylating factor (Arf), and adapter proteins (AP). Different isoforms of both Arf and AP are 
involved at different sites of budding. For example, Arfs 1, 3, and 5 are required for Golgi budding, 
Arf4 for endosomal budding, and Arf6 for plasma membrane budding. Two different classes of coat 
protein have also been identified. Clathrin coats form on vesicles derived from the TGN and PM, 
whereas coatomer (COP) coats form on vesicles derived from the ER and Golgi (MeUman, L (1996) 

25 Annu. Rev. Cell Dev. Biol. 12:575-625). 

Vesicle formation begms when an adapter protein (AP) interacts with cargo proteins within 
the donor membrane and recruits clathrin to the bud site. APs are heterotetrameric con^lexes 
coin>osed of two large chains (a, g, d, or e, and b), a medium chain (ra), and a small chain (s). 
Qathrin binds to APs via the carboxy-terminal appendage domain of the b-adaptin subunit (Le 

30 Bourgne, R. and B, Hoflack (1998) Curr. Opin. Cell. Biol. 10:499-503). AP-1 functions in protein 
sorting from the TGN and endosomes to compartments of the endosomal/lysosomal system. AP-2 
ftmctions in clathrin-mediated endocytosis at the plasma membrane, while AP-3 is associated with 
endosomes and/or the TGN and recruits integral membrane proteins for transport to lysosomes and 
lysosome-related organelles. The recently isolated AP-4 complex localizes to the TGN or a 

35 neighboring compartment and may play a role in sorting events thought to take place in post-Golgi 
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compartments (Deir AngeUca, E.C. et al. (1999) J. Biol. Chem. 274:7278-7285). Cytosolic GTP- 
bound Arf is also incorporated into the vesicle as it forms. Another GTP-binding protein, dynamin, 
forms a ring conq)lex around the neck of the fonning vesicle and provides the mechanochemical force 
required to release the vesicle from the donor membrane. The coated vesicle conq)lex is then 
5 transported through the cytosol. During the transport process, Arf-boiind GTP is hydrolyzed to GDP 
and the coat dissociates from the transport vesicle (West, MA. et al. (1997) J. Cell Biol. 138:1239- 
1254). 

Coat protein (COP) coats form on flie ER and Golgi. COP coats can further be distinguished 
as COPI, involved in retrograde traffic through the Golgi to the ER, and COPII, involved in 

10 anterograde traffic from the ER to the Golgi. The COP coat consists of two major components, a 
GTP-binding protein (Arf or Sar) and coat protomer (coatomer). Coatomer is an equimolar complex . 
of seven proteins, termed alpha-, beta-, beta'-, ganama-, delta-, epsilon- and zeta-COP. The coatomer 
con^ylex binds to dilysine motifs contained on the cytoplasmic tails of integral membrane proteins. 
These include the dilysine-containing retrieval motif of membrane proteins of the ER and 

15 dibasic/diphenylamine motifs of members of the p24 family. The p24 family of type I membrane 
proteins represent the major membrane proteins of COPI vesicles (Barter, C. and RT. Wieland (1998) 
Proc. Natl. Acad. Sci. USA 95:11649-11654). 

Vesicles can undergo homotypic or heterotypic fusion. Molecules required for appropriate 
targeting and fusion of vesicles include proteins in the vesicle membrane, the target membrane, and 

20 proteins recruited from the cytosol. During budding of the vesicle from the donor con5)artment, an 
mtegral membrane protein, VAMP (vesicle-associated membrane protein) is incorporated into the 
vesicle. Soon after the vesicle uncoats, a cytosolic prenylated GTP-binding protein, Rab, is inserted 
into the vesicle membrane. The amino acid sequence of Rab proteins reveals conserved GTP-binding 
domains characteristic of Ras superfamily members. In the vesicle membrane, GTP-bound Rab 

25 interacts with VAMP. Once the vesicle reaches the target membrane, a GTPase activating protein 
(GAP) in the target membrane converts the Rab protein to the GDP-bound fonn. A cytosolic protein, 
guanine-nucleotide dissociation inhibitor (GDI) then removes GDP-bound Rab from the vesicle 
membrane. Several Rab isoforms have been identified and appear to associate with specific 
compartments within the cell. For example, Rabs 4, 5, and 1 1 are associated with the early 

30 endosome, whereas Rabs 7 and 9 associate with the late endosome. These differences may provide 
selectivity in the association between vesicles and their target membranes (Novick, P. and M. Zerial 
(1997) Cur. Opin. CeU Biol. 9:496-504). 

Docking of the transport vesicle with the target membrane involves the formation of a 
complex between the vesicle SNAP receptor (v-SNARE), target membrane (t-) SNAREs, and certain 

35 other membrane and cytosolic proteins. Many of these other proteins have been identified although 
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their exact functions in the docking complex remain micertain (Tellam, J.T. et al. (1995) J. Biol. 
Chem. 270:5857-5863; Hata, Y. and T.C. Sudhof (1995) J. BioL Chem. 270:13022-13028). 
N-ethylmaleimide sensitive factor (NSF) and soluble NSF-attachment protein (a-SNAP and p-SNAP) 
are two such proteins that are conserved from yeast to man and function in most intracellular 

5 membrane fusion reactions. Seel represents a family of yeast proteins that function at many different 
stages in the secretory pathway including membrane fusion. Recenfly, mammalian homologs of Seel, 
called Munc-18 proteins, have been identified (Katagiri, H. et al. (1995) J. BioL CheuL 
270:4963-4966; Hata et al. supraV 

The SNARE complex involves three SNARE molecules, one in the vesicular menibrane and 

10 two in the target membrane. Together they form a rod-shaped complex of four a-helical coiled-coils. 
The membrane anchoring domains of all three SNAREs project from one end of the rod. This 
complex is similar to the rod-like structures formed by fusion proteins characteristic of the enveloped 
viruses, such as myxovirus, influenza, filovirus (Ebola), and the HIV and SIV retroviruses. (Skehel, 
J J. and D.C. Wiley (1998) CeU 95:871-874). It has been proposed that the SNARE complex is 

15 sufficient for membrane fusion, suggesting that the proteins which associate with the complex 

provide regulation over the fusion event (Weber, T. et al. (1998) Cell 92:759-772). For example, in 
neurons, which exhibit regulated exocytosis, docked vesicles do not fuse with the presynaptic 
membrane until depolarization, which leads to an influx of calcium (Bennett, M.K. and R.H. Scheller 
(1994) Annu. Rev. Biochem. 63:63-100). Synaptotagmin, an integral membrane protem in the 

20 synaptic vesicle, associates with the t-SNARE syntaxin in the docking complex. Synaptotagmin 

binds calcium in a complex with negatively charged phospholipids, which allows the cytosolic SNAP 
protein to displace synaptotagmin from syntaxin and fusion to occur. Thus, synaptotagmin is a 
negative regulator of fusion in the neuron (Littleton, JT. et al. (1993) Cell 74:1125-1134). The most 
abundant membrane protein of synaptic vesicles appears to be the glycoprotein synaptophysin,, a 38 

25 kDa protein with four transmembrane domains. Although the function of synaptophysin is not 
known, its calcium-binding ability, tyrosine phosphorylation, and widespread distribution in neural 
tissues suggest a potential role in neurosecretion (Bennett, supra). 

The transport of proteins into and out of vesicles relies on interactions between cell 
membranes and a supporting membrane cytoskeleton consisting of spectrin and other proteins. A 

30 large family of related proteins called ankyrins participate in the transport process by binding to the 
membrane skeleton proteia spectrin and to a protein in the cell membrane called band 3, a component 
of an anion channel in the cell membrane. Ankyrins therefore function as a critical link between the 
cytoskeleton and the cell membrane. 

Originally found in association with erythroid cells, ankyrins are also found in other tissues 

35 as weU (Birkenmeier, C.S. et al. (1993) J. Biol. Chem. 268:9533-9540). Ankyrins are large proteins 
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(-1800 amino acids) containing an N-terminal, 89 kDa domain that binds the cell membrane proteins 
band 3 and tubulin, a central 62 kDa domain that binds the cytoskeletal proteins spectrin and 
vimentin, and a C-terminal, 55 kDa regulatory domain that functions as a modifier of the binding 
activities of the other two domains. Individual genes for ankyrin are able to produce multiple ankyrin 

5 isoforms by various insertions and deletions. These isofonns are of nearly identical size but may 
have different functions. In addition, smaller transcripts are produced which are missing large 
regions of the coding sequences from the N-temodnal (band 3 binding), and central (spectrin bindmg) 
domains. The existence of such a large family of ankyrin proteins and the observation that more than 
one type of ankyrin may be expressed in the same cell type suggests that ankyrins may have more 

10 specialized functions than simply bindmg the membrane skeleton to the plasma membrane 
(Birkenmeier, supra) . 

In humans, two isoforms of ankyrin are expressed, alternatively, in developing erythroids and 
mature erythroids, respectively (Lambert, S. et. al. (1990) Proc. Natl. Acad. Sci. USA 87:1730-1734). 
A deficiency in eiythroid spectrin and ankyrin has been associated with the hemolytic anemia, 

15 hereditary spherocytosis (Coetzer, T,L. et al. (1988) New Engl. J. Med. 318:230-234). 

Correct trafficking of proteins is of particular importance for the proper function of epithelial 
cells, which are polarized into distinct apical and basolateral domains containing different cell 
membrane components such as lipids and membrane-associated proteins. Certain proteins are 
flexible and may be sorted to the basolateral or apical side depending upon cell type or growth 

20 conditions. For example, the kidney anion exchanger (kAEl) can be retargeted from the apical to the 
basolateral domain if cells are plated at higher density. The protein kanadaptin was isolated as a 
protein which binds to the cytoplasmic domain of kAEl . It also colocalizes with kAEl in vesicles, 
but not in the membrane, suggesting that kanadaptin's function is to guide kAEl-containing vesicles 
to the basolateral target membrane (Chen, J. et al. (1998) J. Biol. Chem. 273:1038-1043). 

25 Vesicle trafficking is crucial m the process of neurotransmission. Synaptic vesicles cany . 

neurotransmitter molecules from the cytoplasm of a neuron to the synapse. Rab3's are a family of 
GTP-binding proteins located on synaptic vesicles. The RIM family of proteins are thought to be 
effectors for Rab3*s (Wang, Y. et al. (2000) J. Biol. Chem. 275:20033-20044). Rabphilin-3 is a 
synaptic vesicle protein. Granuphilins are proteins with homology to rabphilins, and may have a 

30 unique role in exocytosis (Wang, J. et al. (1999) J. Biol. Chem. 274:28542-28548). 

The etiology of numerous human diseases and disorders can be attributed to defects in the 
trafficking of proteins to organelles or the cell surface. Defects in the trafficking of membrane-bound 
receptors and ion channels are associated with cystic fibrosis (cystic fibrosis transmembrane 
conductance regulator; CFTR), glucose-galactose malabsorption syndrome (NaVglucose 

35 cotransporter), hypercholesterolemia Qow-density lipoprotein (LDL) receptor), and forms of diabetes 

5 
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meUitus (insulin receptor). Abnonnal hormonal secretion is linked to disorders including diabetes 
insipidus (vasopressin), hyper- and hypoglycemia (insulin, glucagon). Grave's disease and goiter 
(thyroid hormone), and Cushing's and Addison's diseases (adrenocorticotropic hormone; ACTH). 
Cancer cells secrete excessive amounts of hormones or otiier biologically active peptides. 

5 Disorders related to excessive secretion of biologicaUy active peptides by tumor ceUs include: fasting 
hypoglycemia due to increased insulin secretion from insulinoma-islet ceU tumors; hypertension due 
to increased epinephrine and norepinephrine secreted from pheochroraocytomas of the adrenal 
meduUa and sympathetic paraganglia; and carcinoid syndrome, which includes abdominal cramps, 
diarrhea, and valvular heart disease, caused by excessive amounts of vasoactive substances 

10 (serotonin, bradykdnin, histamine, prostaglandins, and polypeptide hormones) secreted from intestinal 
tumors. Ectopic synthesis and secretion of biologically active peptides (peptides not expected from a 
tumor) includes ACTH and vasopressin in lung and pancreatic cancers; parathyroid hormone in hmg 
and bladder cancers; calcitonin in lung and breast cancers; and thyroid-stimulating hormone in 

medullary tiiyroid carcinoma. 
15 Various human pathogens alter host ceU protein trafficking pathways to their own advantage. 

For example, the HIV protein Nef downregulates cell-surface expression of CD4 molecules by 
accelerating their endocytosis through clathrin coated pits. This function of Nef is important for the 
. spread of HIV from the infected cell (Harris, M. (1999) Curr. Biol. 9:R449-R461). A lecentiy 
identified human protein, Nef-associated factor 1 (Nafl), a protein with four extended coiled-coil 
20 domains, has been found to associate with Nef. Overexpression of Nafl increased ceU surface 

expression of CD4, an effect which could be suppressed by Nef (Fukushi. M. et al. (1999) FEBS Lett. 
442:83-88). 

The discovery of new vesicle trafficking proteins and the polynucleotides encoding them 
satisfies a need in the art by providing new compositions which are useful in the diagnosis. 
25 prevention, and treatment of vesicle trafficking disorders, autoimmune/inflammatory disorders, and 
cancer, and in the assessment of the effects of exogenous compounds on tiie expression of nucleic 
acid and amino acid sequences of vesicle trafficking proteins. 

SUMMARY OF THE INVENTION 
30 The invention features purified polypeptides, vesicle trafficking proteins, referred to 

collectively as "VETRF' and individually as "VETRP-l," "VETRP-2," "VETRP-S." ''VErRP-4," 
"VETRP-S," "VETRP-e." "VETRP-?," "VETRP-8," "VETRP-9," "VETRP-10." "VETRP-11," 
«VFrRP-12," "VETRP-13," "VETRP-I4," "VETRP-IS," "VETRP-16," "VETRP-17," "VETRP-18," 
"VErRP-19," "VErRP-20," "VETRP-ll," "VETRP-22," and "VETRP-23." In one aspect, die 
35 invention provides an isolated polypeptide comprising an amino acid sequence selected from the 
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group consisting of a) an amino acid sequence selected from the group consisting of SEQ ID NO: 1- 
23, b) a naturally occurring amino acid sequence having at least 90% sequence identity to an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-23, c) a biologically active 
fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-23, and d) 

5 an immunogenic fragment of an ammo acid sequence selected from the group consisting of SEQ ID 
NO: 1-23. In one alternative, the invention provides an isolated polypeptide comprising the amino 
acid sequence of SEQ ID NO: 1-23. 

The invention ftuther provides an isolated polynucleotide encoding a polypeptide comprising 
an amino acid sequence selected from the group consisting of a) an amino acid sequence selected 

10 from the group consisting of SEQ ID NO:l-23, b) a naturally occurring amino acid sequence having 
at least 90% sequence identity to an amino acid sequence selected from the group consisting of SEQ 
ID NO; 1-23, c) a biologically active fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-23, and d) an immunogenic fragment of an amino add sequence selected 
from the group consisting of SEQ ID NO: 1-23. In one alternative, the polynucleotide encodes a 

15 polypeptide selected from the group consistmg of SEQ ID NO: 1-23. In another alternative, the 
polynucleotide is selected from the group consisting of SEQ ID NO:24-46. 

Additionally, the invention provides a recombinant polynucleotide comprising a promoter 
sequence operably linked to a polynucleotide encoding a polypeptide comprising an amino acid 
sequence selected from the group consisting of a) an amino acid sequence selected from the group 

20 consisting of SEQ ID NO:l-23, b) a naturally occurring ammo acid sequence having at least 90% 
sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-23, 
c) a biologically active fragment of an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-23, and d) an inmmnogenic fragment of an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-23. In one alternative, the invention provides a cell transformed 

25 with the recombinant polynucleotide. In another alternative, the invention provides a transgenic 
organism comprising the recombinant polynucleotide. 

The mvention also provides a method for producing a polypeptide comprising an amino acid 
sequence selected from the group consisting of a) an amino acid sequence selected from the group 
consistmg of SEQ ID NO: 1-23, b) a naturally occurring amino acid sequence having at least 90% 

30 sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-23, 
c) a biologically active fragment of an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-23, and d) an immunogenic fragment of an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-23. The method comprises a) culturing a cell under conditions 
suitable for expression of the polypeptide, wherem said cell is transformed with a recombinant 

35 polynucleotide comprising a promoter sequence operably linked to a polynucleotide encoding the 
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polypeptide, and b) recovering the polypeptide so expressed. 

Additionally, the invention provides an isolated antibody which specifically binds to a 
polypeptide comprising an amino acid sequence selected from the group consisting of a) an amino 
acid sequence selected from the group consisting of SEQ ID NO:l-23, b) anatorally occurring amino 
5 acid sequence having at least 90% sequence identity to an amino acid sequence selected from tiie 
group consisting of SEQ ID NO: 1-23, c) a biologically active fragment of an amino acid sequence 
selected from flie group consisting of SEQ ID NO: 1-23. and d) an immunogenic fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-23. 

The invention further provides an isolated polynucleotide comprising a polynucleotide 
10 sequence selected from tiie group consisting of a) a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:24-46, b) a naturally occurring polynucleotide sequence having at least 
70% sequence identity to a polynucleotide sequence selected from die group consisting of SEQ ID 
NO:24-46, c) a polynucleotide sequence complementary to a), d) a polynucleotide sequence 
complementary to b), and e) an RNA equivalent of a)-d). In one alternative, the polynucleotide 
15 comprises at least 60 contiguous nucleotides. 

Additionally, the invention provides a method for detecting a target polynucleotide in a 
sample, said target polynucleotide having a sequence of a polynucleotide conq>rising a polynucleotide 
sequence selected from die group consisting of a) a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:24-46, b) a naturally occurring polynucleotide sequence having at least 
20 70% sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:24-46, c) a polynucleotide sequence complementary to a), d) a polynucleotide sequence 
complementary to b). and e) an RNA equivalent of a)-d). The metiiod comprises a) hybridizing die 
sample witii a probe comprising at least 20 contiguous nucleotides comprising a sequence 
complementary to said target polynucleotide in tiie sample, and which probe specificaUy hybridizes to 
25 said target polynucleotide, under conditions whereby a hybridization con^lex is formed between said 
probe and said target polynucleotide or fragments tfiereof, and b) detecting the presence or absence of 
said hybridization conq>lex, and optionally, if present, the amount thereof. In one alternative, tiie 
probe comprises at least 60 contiguous nucleotides. 

The invention furtiier provides a metiiod for detecting a target polynucleotide in a sample, 
30 said target polynucleotide havmg a sequence of a polynucleotide conq>rising a polynucleotide 

sequence selected from the group consisting of a) a polynucleotide sequence selected from flie group 
consisting of SEQ ID NO:24-46, b) a naturally occurring polynucleotide sequence having at least 
70% sequence identity to a polynucleotide sequence selected from tiie group consisting of SEQ ID 
N0:2446, c) a polynucleotide sequence complementary to a), d) a polynucleotide sequence 
35 complementary to b), and e) an RNA equivalent of a)-d). The metiiod comprises a) amplifying said 
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target polynucleotide or fragment thereof using polymerase chain reaction amplification, and b) 
detecting the presence or absence of said amplified target polynucleotide or fragment thereof, and, 
optionally, if present, the amount thereof. 

The invention further provides a composition comprising an effective amount of a 

5 polypeptide comprising an anodno acid sequence selected from the group consisting of a) an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-23, b) a naturally occurring amino 
acid sequence having at least 90% sequence identity to an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-23, c) a biologically active fragment of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-23, and d) an immunogenic fragment of an ammo 

10 acid sequence selected from the group consisting of SEQ ID NO: 1-23, and a pharmaceuticaUy 
acceptable excipient. In one embodiment, the composition comprises an amino acid sequence 
selected from the group consisting of SEQ ID NO:l-23. The invention additionaUy provides a 
method of treating a disease or condition associated with decreased expression of fimctional VETRP, 
comprising administering to a patient in need of such treatment the composition. 

15 The invention also provides a method for screening a compound for effectiveness as an 

agonist of a polypeptide comprismg an amino acid sequence selected from the group consisting of a) 
an amino acid sequence selected from the group consisting of SEQ ID NO;l-23, b) a naturally 
occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-23, c) a biologicaUy active fragment of an amino 

20 acid sequence selected from the group consistmg of SEQ ID NO: 1-23, and d) an unmrmogenic 
fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-23. The 
method comprises a) exposing a sample comprising the polypeptide to a compound, and b) detecting 
agonist activity in the sample. In one alternative, the invention provides a composition comprising an 
agonist compound identified by the method and a pharmaceutically acceptable excipient. In another 

25 alternative, the invention provides a method of treating a disease or condition associated with 
decreased expression of functional VETRP, comprising administering to a patient in need of such 
treatment the composition. 

Additionally, the invention provides a method for screening a compound for effectiveness as 
an antagonist of a polypeptide comprising an amino acid sequence selected from the group consisting 

30 of a) an amino acid sequence selected from the group consisting of SEQ ID NO; 1-23, b) a naturally 
occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-23, c) a biologically active fragment of an amino 
acid sequence selected from die group consisting of SEQ ID NO: 1-23, and d) an immunogenic 
fragment of an ammo acid sequence selected from the group consisting of SEQ ID NO:l-23. The 
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method comprises a) exposing a sample comprising the polypeptide to a compomid, and b) detecting 
antagonist activity in the sample. In one alternative, the invention provides a composition comprising 
an antagonist compound identified by the method and a pharmaceutically acceptable excipient In 
another alternative, the invention provides a method of treating a disease or condition associated with 
5 overexpression of functional VETRP, comprising administering to a patient in need of such treatment 
the composition. 

The invention fiirther provides a method of screening for a compound that specifically binds 
to a polypeptide con5)rising an amino acid sequence selected from the group consisting of a) an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-23, b) a naturally occurring 
10 amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from 
tiie group consisting of SEQ ID NO:l-23. c) a biologically active fiagment of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-23, and d) an immunogenic fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO:l-23. The method comprises a) 
combining the polypeptide with at least one test compound under suitable conditions, and b) 
15 detecting binding of the polypeptide to tiie test compound, tiiereby identifying a compound that 
specifically binds to the polypeptide. 

The invention fiirther provides a method of screening for a compound that modulates the 
activity of a polypeptide comprising an amino acid sequence selected fix)m the group consisting of a) 
an amino acid sequence selected from the group consisting of SEQ ID NO:l-23, b) a naturaUy 
20 occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 

selected ftom the group consisting of SEQ ID NO:l-23, c) a biologically active fragment of an amino 
acid sequence selected fixjm the group consisting of SEQ ID NO:l-23, and d) an immunogenic 
fragment of an amino acid sequence selected bom the group consisting of SEQ ID NO:l-23. The 
mefliod comprises a) combining the polypeptide witii at least one test compound under conditions 
25 permissive for the activity of the polypeptide, b) assessing the activity of the polypeptide in the 

presence of the test compound, and c) comparing tiie activity of die polypeptide in the presence of tiie 
test compound with tiie activity of die polypeptide in tiie absence of the test compound, wherein a 
change in tiie activity of tiie polypeptide in flie presence of tiie test compound is indicative of a 
compound tiiat modulates tiie activity of the polypeptide. 
30 The invention fiirther provides a mefliod for screening a compound for effectiveness in 

altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 
sequence selected from die group consisting of SEQ ID NO:24-46, die metfiod comprising a) 
exposing a sample conq)rising die target polynucleotide to a compound, and b) detecting altered 
expression of the target polynucleotide. 
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The invention further provides a method for assessing toxicity of a test compound, said 
method comprising a) treating a biological sample containing nucleic acids with the test compound; 
b) hybridizmg the nucleic acids of the treated biological sample with a probe comprising at least 20 
contiguous nucleotides of a polynucleotide comprising a polynucleotide sequence selected from the 

5 group consisting of i) a polynucleotide sequence selected from flie group consisting of SEQ ID 

NO:24-46, ii) a naturally occurring polynucleotide sequence having at least 70% sequence identity to 
a polynucleotide sequence selected from the group consisting of SEQ ID NO:24-46, iii) a 
polynucleotide sequence complementary to i), iv) a polynucleotide sequence complenaentary to ii), 
and v) an RNA equivalent of i)-iv). Hybridization occurs under conditions whereby a specific 

10 hybridization complex is formed between said probe and a target polynucleotide in the biological 
sample, said target polynucleotide comprising a polynucleotide sequence selected from the group 
consisting of i) a polynucleotide sequence selected from the group consisting of SEQ ID NO:24-46, 
ii) a naturally occurring polynucleotide sequence having at least 70% sequence identity to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:24-46, iii) a 

15 polynucleotide sequence complementary to i), iv) a polynucleotide sequence complementary to ii), 
and v) an RNA equivalent of iHv). Alternatively, the target polynucleotide comprises a fragment of 
a polynucleotide sequence selected from the group consisting of i)-v) above; c) quantifying the 
amount of hybridization complex; and d) comparing the amount of hybridization complex in the 
treated biological sample with the amount of hybridization complex in an untreated biological 

20 sample, wherein a difference in the amount of hybridization complex in the treated biological sample 
is indicative of toxicity of the test compound. 

BRIEF DESCRIPTION OF THE TABLES 

Table 1 shows polypeptide and nucleotide sequence identification numbers (SEQ ID NOs), 
25 clone identification numbers (clone IDs), cDNA libraries, and cDNA fragments used to assemble full- 
length sequences encoding VETRP. 

Table 2 shows features of each polypeptide sequence, including potential motifs, homologous 
sequences, and methods, algorithms, and searchable databases used for analysis of VETRP. 
Table 3 shows selected fragments of each nucleic acid sequence; the tissue-specific 
30 expression patterns of each nucleic acid sequence as determined by northem analysis; diseases, 
disorders, or conditions associated with these tissues; and die vector into which each cDNA was 
cloned. 

Table 4 describes the tissues used to construct the cDNA libraries from which cDNA clones 
encoding VETRP were isolated. 
35 Table 5 shows the tools, programs, and algorithms used to analyze the polynucleotides and 
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polypeptides of the invention, along with applicable descriptions, references, and threshold 
parameters. 



DESCRIPTION OF THE INVENTION 

5 Before the present proteins, nucleotide sequences, and methods are described, it is understood 

that this invention is not limited to the particular machines, materials and methods described, as these 
may vary. It is also to be understood that the terminology used herein is for the purpose of describing 
particular embodiments only, and is not intended to limit the scope of the present invention which 
will be limited only by the appended claims. 

10 It must be noted that as used herein and in the appended claims, the singular forms "a,** "an " 

and '*the" include plural reference unless the context clearly dictates otherwise. Thus, for example, a 
reference to "a host cell" includes a plurality of such host cells, and a reference to "an antibody" is a 
reference to one or more antibodies and equivalents thereof known to those skilled in the art, and so 
forth. 

15 Unless defined otherwise, all technical and scientific terms used herein have the same 

meanings as commonly understood by one of ordinary skill in the art to which this invention belongs. 
Although any machines, materials, and methods similar or equivalent to those described herein can be 
used to practice or test the present invention, the preferred machines, materials and methods are now 
described. All publications mentioned herein are cited for the purpose of describmg and disclosing 

20 the cell lines, protocols, reagents and vectors which are reported in the publications and which might 
be used in connection with the invention. Nothing herein is to be construed as an admission that the 
invention is not entitled to antedate such disclosure by virtue of prior invention. 
DEFINITIONS 

'*VETRP' refers to the amino acid sequences of substantially purified VETRP obtained from 
25 any species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and 
human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant 

The term "agonisf refers to a molecule which intensifies or mimics the biological activity of 
VETRP. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any other 
con^)Ound or composition which modulates the activity of VETRP either by directly interacting with 
30 VETRP or by acting on components of the biological pathway in which VETRP participates. 

An "allelic variant" is an alternative form of the gene encoding VETRP. Allelic variants may 
result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs or in 
polypeptides whose structure or ftmction may or may not be altered. A gene may have none, one, or 
many allelic variants of its naturally occurring form. Common mutational changes which give rise to 
35 allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. 
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Each of these types of changes may occur alone, or in combination with the others, one or more times 
in a given sequence. 

"Altered" nucleic acid sequences encoding VETRP include those sequences with deletions, 
insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as VETRP or 
a polypeptide with at least one functional characteristic of VETRP. Included within this definition 
are polymorphisms which may or may not be readUy detectable using a particular oligonucleotide 
piobe of the polynucleotide encoding VETRP, and improper or unexpected hybridization to alleUc 
variants, with a locus other than the normal chromosomal locus for the polynucleotide sequence 
encoding VETRP. Hie encoded protein may also be "altered," and may contain deletions, insertions, 
or substitutions of amino acid residues which produce a sUent change and result in a functionally 
equivalent VETRP. Deliberate amino acid substitutions may be made on flie basis of simflarity in 
polarity, charge, solubiUty, hydrophobicity. hydrophilicity, and/or the amphipathic nature of the 
residues, as long as the biological or immunological activity of VETRP is retained. For example, 
negatively charged amino acids may include aspartic acid and glutamic acid, and positively charged 
amino acids may include lysine and argmine. Amino acids with uncharged polar side chains having 
similar hydrophilicity values may include: asparagine and glutamine; and serine and threonine. 
Amino acids with uncharged side chains having similar hydrophilicity values may include: leucine, 
isoleudne. and valme; glycme and alanine; and phenylalanine and tyrosine. 

The terms "amino acid" and "amino acid sequence" refer to an oligopeptide, peptide, 
polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or synthetic 
molecules. Where "ammo acid sequence" is recited to refer to a sequence of a naturaUy occurring 
protein molecule, "amino acid sequence" and like terms are not meant to limit the amino acid 
sequence to the complete native amino acid sequence associated with the recited protein molecule. 

"Amplification" relates to the production of additional copies of a nucleic acid sequence. 
Amplification is generally carried out using polymerase chain reaction (PGR) technologies well 
known in the art 

The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity 
of VETRP. Antagonists may include proteins such as antibodies, nucleic acids, caibohydrates, small 
molecules, or any other compound or conq)Osition which modulates the activity of VETRP either by 
directly interacting with VETRP or by actmg on components of the biological pathway in which 
VETRP participates. 

The term "antibody" refers to intact immunoglobulin molecules as well as to fragments 
thereof, such as Fab, F(ab')2, and Fv fragments, which are capable of binding an epitopic determinant. 
Antibodies that bind VETRP polypeptides can be prepared using intact polypeptides or using 
fragments containing small peptides of interest as the immunizing antigen. The polypeptide or 
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oUgopeptide used to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the 
translation of RNA, or synthesized chemicaUy, and can be conjugated to a carrier protein if desired. 
Commonly used carriers that are chemically coupled to peptides include bovine serum albumin, 
thyroglobulin, and keyhole limpet hemocyanin (KLH). The coupled peptide is then used to immunize 
5 the animal. 

The term "antigenic detenninant" refers to that region of a molecule (i.e., an epitope) that 
makes contact with a particular antibody. Whoi a protein or a fragment of a protein is used to 
immunize a host animal, numerous regions of the protein may induce the production of antibodies 
which bind specificaUy to antigenic determinants (particular regions or three-dimensional stmctares 
10 on die protein). An antigenic detenninant may compete with the intact antigen (i.e., the immunogen 
used to elicit the inomune response) for binding to an antibody. 

The term "antisense" refers to any composition capable of base-pairing with the "sense" 
(coding) strand of a specific nucleic acid sequence. Antisense compositions may include DNA; 
RNA; peptide nucleic acid (PNA); oligonucleotides having modified backbone linkages such as 
15 phosphorothioates, methylphosphonates, or benzylphosphonates; oligonucleotides having modified 
sugar groups such as 2'-methoxyethyl sugars or 2'-methoxyethoxy sugars; or oligonucleotides having 
. modified bases such as 5-niethylcytosine,2'-deoxyuracil. or 7-<iea2a-2'-deoxyguanosine. Antisense 
molecules may be produced by any method including chemical synthesis or transcription. Once 
introduced into a ceU, the complementary antisense molecule base-pairs wifli a naturally occurring 
20 nucleic acid sequence produced by the ceU to form duplexes which block eiflier transcription or 
translation. The designation "negative" or "minus" can refer to the antisense strand, and the 
designation "positive" or "pias" can refer to the sense strand of a reference DNA molecule. 

The term "biologically active" refers to a protein having structural, regulatory, or biochemical 
functions of a naturaDy occurring molecule. Likewise, "immunologically active" or "immunogenic" 
25 refers to die capabiHty of die natural, recombinant, or syndietic VETRP, or of any oligopeptide 

dieieof, to induce a specific immune response in appropriate animals or cells and to bind with specific 
antibodies. 

"Con5)lementary" describes the relationship between two single-stranded nucleic acid 
sequences that anneal by base-pairing. For example, 5'-AGT-3' pairs vsdth its complement, 
30 y-TCA-5*. 

A "con^osition con5)rising a given polynucleotide sequence" and a "composition comprising 
a given amino acid sequence" refer broadly to any con?)Osition containing the given polynucleotide 
or amino acid sequence. The composition may comprise a diy formulation or an aqueous solution. 
Compositions comprising polynucleotide sequences encoding VETRP or fragments of VETRP may 
35 be employed as hybridization probes. The probes may be stored in freeze-dried form and may be 
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associated with a stabilizing agent such as a carbohydrate. In hybridizations, the probe may be 
deployed m an aqueous solution containing salts (e.g., NaQ), detergents (e.g., sodium dodecyl 
sulfate; SDS), and oAer components (e.g., Denhardt's solution, dry milk, sahmon sperm DNA, etc.). 

"Consensus sequence" refers to a nucleic acid sequence which has been subjected to repeated 

5 DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit (Applied 
Biosystems, Foster City CA) in the 5* and/or the 3' direction, and resequenced, or which has been 
assembled from one or more overlapping cDNA, EST, or genomic DNA fragments using a computer 
program for fragment assembly, such as the GELVEW fragment assembly system (GCG, Madison 
WI) or Phrap (University of Washington, Seattle WA). Some sequences have been both extended and 

10 assembled to produce the consensus sequence. 

"Conservative amino acid substitutions" are those substitutions that are predicted to least 
interfere with the properties of the original protem, i.e., the structure and especiaUy die function of 
the protein is conserved and not significantly changed by such substitutions. The table below shows 
amino adds which may be substituted for an original amino acid in a protem and which are regarded 

15 as conservative amino acid substitutions. 

Conservative Substitution 





Ala 


Gly, Ser 




Arg 


His, Lys 




Asn 


Asp, Gbi, His 


20 


Asp 


Asn, Glu 




Cys 


Ala, Ser 




Gto 


Asn, Glu, His 




Glu 


Asp, Gin, His 




Gly 


Ala 


25 


His 


Asn, Arg, Ghi, Glu 


De 


Leu, Val 




Leu 


He, Val 




Lys 


Arg, Gto, Glu 




Met 


Leu, He 


30 


Phe 


His, Met, Leu, Trp, Tyr 


Ser 


Cys, Thr 




Thr 


Ser, Val 




Trp 


Phe, Tyr 




Tyr 


His, Phe, Trp 


35 


Val 


He, Leu, Thr 



Conservative amino acid substitutions generally maintain (a) the structure of the polypeptide 
backbone in the area of the substitution, for exan()le, as a beta sheet or alpha helical conformation, 
(b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or (c) the bulk of 
40 the side chain. 

A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
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absence of one or more amino acid residues or nucleotides. 

The term "derivative" refers to a chemically modified polynucleotide or polypeptide. 
Chemical modifications of a polynucleotide sequence can include, for example, replacement of 
hydrogen by an alkyl, acyl, hydroxyl, or amino group. A derivative polynucleotide encodes a 
5 polypeptide which retains at least one biological or immunological function of the natural molecule. 
A derivative polypeptide is one modified by glycosylation, pegylation, or any similar process that 
retains at least one biological or immunological function of the polypeptide from which it was 
derived. 

A "detectable label" refers to a reporter molecule or enzyme that is capable of generating a 
10 measurable signal and is covalently or noncovalendy joined to a polynucleotide or polypeptide. 

A "fragment" is a unique portion of VETRP or the polynucleotide encoding VETRP which is 
identical in sequence to but shorter in length than the parent sequence. A fragment may comprise up 
to the entire lengtfi of the defined sequence, minus one nucleotide/amino acid residue. For example, 
a fragment may comprise from 5 to 1000 contiguous nucleotides or amino acid residues. A fragment 
15 used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at least 5, 10, 
15, 16, 20. 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous nucleotides or amino acid 
residues in lengtii. Fragments may be preferentially selected from certain regions of a molecule. For 
example, a polypeptide fragment may comprise a certain lengtii of contiguous amino acids selected 
from the first 250 or 500 amino acids (or first 25% or 50% of a polypeptide) as shown in a certain 
20 defined sequence. Qearly tiiese lengtiis are exemplary, and any lengtii tiiat is supported by tiie 

specification, mcluding die Sequence Listing, tables, and figures, may be encompassed by die present 
embodiments. 

A fragment of SEQ ID NO:24-46 comprises a region of unique polynucleotide sequence diat 
specifically identifies SEQ ID NO:24-46, for example, as distinct fi^m any oflier sequence in tiie 

25 genome from which die fragment was obtained. A fragment of SEQ ID NO:24-46 is usefiil, for 
example, in hybridization and amplification technologies and in analogous metiiods tiiat distinguish 
SEQ ID NO:24-46 from related polynucleotide sequences. The precise lengtii of a fragment of SEQ 
ID NO:24-46 and tiie region of SEQ ID NO:24-46 to which tiie fragment corresponds are routinely 
determinable by one of ordinary skill in die art based on die intended purpose for die fragment. 

30 A fragment of SEQ ID NO:l-23 is encoded by a fragment of SEQ ID NO:24-46. A firagment 

of SEQ ID NO: 1-23 comprises a region of unique amino acid sequence tiiat specifically identifies 
SEQ ID NO: 1-23. For example, a fragment of SEQ ID NO:l-23 is usefiil as an immunogenip peptide 
for die development of antibodies tiiat specifically recognize SEQ ID NO:l-23. The precise lengtii of 
a fragment of SEQ ID NO: 1-23 and die region of SEQ ID NO: 1-23 to which die firagment 

35 corresponds are routinely determinable by one of ordinary skill in tiie art based on die intended 
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purpose for the fragment. 

A "fuU-length" polynucleotide sequence is one containing at least a translation initiation 
codon (e.g.. methionine) followed by an open reading frame and a translation termination codon. A 
"full-length" polynucleotide sequence encodes a "fiiU-length" polypeptide sequence. 
5 -Homology" refers to sequence similarity or, interchangeably, sequence identity, between 

two or more polynucleotide sequences or two or more polypeptide sequences. 

The terms "percent identity" and "% identity," as applied to polynucleotide sequences, refer 
to the percentage of residue matches between at least two polynucleotide sequences aligned using a 
standardized algorithm. Such an algorithm may insert, m a standardized and reproducible way, gaps 
10 in the sequences being compared in order to optimize alignment between two sequences, and 
tiierefore achieve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEGAUGN version 3.12e 
sequence aUgmnent program. This program is part of the LASERGENE software package, a suite of 
15 molecularbiologicalanalysisprograms(DNASTAR,MadisonWD. CLUSTAL V is described in 
Higgins, D.G. and PM. Sharp (1989) CABIOS 5:151-153 and in Higgins. D.G. et al. (1992) CABIOS 
8:189-191. For pairwise aUgnments of polynucleotide sequences, the default parameters are set as 
follows: Ktuple=2, gap penalty=5, window=4, and "diagonals saved"=4. The "weighted" residue 
weight table is selected as the default. Percent identity is reported by CLUSTAL V as the "percent 
20 simflarity" between aligned polynucleotide sequences. 

Alternatively, a suite of commonly used and freely avaUable sequence comparison algorithms 
is provided by tiie National Center for Biotechnology Information (NCBD Basic Local Aligmnent 
Search Tool (BLAST) (Altschul, S.F. et al. (1990) J. Mol. Biol. 215:403^10). which is available 
from several sources, including the NCBI, Bethesda, MD, and on tiie hitemet at 
. 25 http://www.ncbi.nlm.nih.gov/BLAST/. The BLAST software suite includes various sequence 
analysis programs including "blastn," tiiat is used to aUgn a known polynucleotide sequence with 
other polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 2 
Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
Sequences" can be accessed and used interactively at http://www.ncbi.nlnLnih.gov/gor&bl2.html. 
30 The "BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 
programs are commonly used witii gap and oflier parameters set to default settings. For example, to 
compare two nucleotide sequences, one may use blastn witii the "BLAST 2 Sequences" tool Version 
2.0.12 (April-21-2000) set at default parameters. Such default parameters may be, for example: 
Matrix: BLOSUM62 
35 Reward for tnatch: 1 
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Penalty for mismatch: -2 

Open Gap: 5 and Extension Gap: 2 penalties 

Gap X drop-off: 50 

Expect: 10 

Word Size: 11 

Filter: on 

Percent identity may be measured over the length of an entire defined sequence, for example, 
as defined by a particular SEQ ID number, or may be measured over a shorter length, for example, 
over the length of a fragment taken fiom a larger, defined sequence, for instance, a fiagment of at 
least 20, at least 30, at least 40, at least 50, at least 70, at least 100. or at least 200 contiguous 
nucleotides. Such lengths are exenq)lary only, and it is understood that any fragment length 
supported by the sequences shown herein, in the tables, figures, or Sequence Listing, may be used to 
describe a length over which percentage identity may be measured. 

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 
similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes 
in a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid 
sequences that all encode substantially the same protein. 

The phrases "percent identity" and "% identity," as applied to polypeptide sequences, refer to 
the percentage of residue matches between at least two polypeptide sequences aUgned using a 
standardized algorithm. Methods of polypeptide sequence alignment are weU-known. Some 
aUgnment methods take into account conservative amino acid substitutions. Such conservative 
substitutions, explained in more detail above, generally preserve the charge and hydrophobicity at the 
site of substitution, thus preserving the structore (and therefore fimction) of the polypeptide. 

Percent identity between polypeptide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3.12e 
sequence aUgnment program (described and referenced above). For pairwise alignments of 
polypeptide sequences using CLUSTAL V, the default parameters are set as foUows: Ktuple=l, gap 
penalty=3, window=5, and "diagonals saved"=5. The PAM250 matrix is selected as the default 
residue weight table. As with polynucleotide aUgnments, the percent identity is reported by 
CLUSTAL V as the "percent similarity" between aUgned polypeptide sequence pairs. 

Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 
comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 
2.0.12 (Apr-21-2000) with blastp set at default parameters. Such default parameters may be, for 

example: 

Matrix: BLOSUM62 
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Open Gap: 11 and Extension Gap: 1 penalties 

Gap X drop-off: 50 

Expect: 10 

WordSizje:3 

Filter: on 

Percent identity may be measured over the length of an entire defined polypeptide sequence, 
for example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for 
example, over the length of a fragment taken from a larger, defined polypeptide sequence, for 
instance, a fragment of at least 15, at least 20, at least 30, at least 40. at least 50, at least 70 or at least 
150 contiguous residues. Such lengths are exemplary only, and it is understood that any fragment 
length supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be 
used to describe a length over which percentage identity may be naeasured. 

"Human artificial chromosomes" (HACs) are linear microchromosomes which may contain 
DNA sequences of about 6 kb to 10 Mb in size, and which contain all of the elements required for • 
chromosome replication, segregation and maintenance. 

The term "humanized antibody" refers to an antibody molecule in which the amino acid 
sequence in the non-antigen bmding regions has been altered so that the antibody more closely 
resembles a human antibody, and stiU retains its original binding ability. 

"Hybridization" refers to the process by which a polynucleotide strand anneals vnth a 
complementary strand through base pairing under defined hybridization conditions. Specific 
hybridization is an indication that two nucleic acid sequences share a high degree of complementarity. 
Specific hybridization complexes form under permissive annealing conditions and remain hybridized 
after the "washing" step(s). The washing step(s) is particularly important in determining the 
stringency of the hybridization process, with more stringent conditions allowing less non-specific 
binding, i.e., binding between pairs of nucleic acid strands diat are not perfectiy matched. Permissive 
conditions for annealing of nucleic acid sequences are routinely determinable by one of ordinary skiU 
in the art and may be consistent among hybridization experiments, whereas wash conditions may be 
varied among experiments to achieve the desired stringency, and therefore hybridization specificity. 
Permissive annealing conditions occur, for example, at 68°C in flie presence of about 6 x SSC, about 
1% (w/v) SDS, and about 100 pg/ml sheared, denatured salmon sperm DNA. 

Generally, stringency of hybridization is expressed, in part, with reference to the temperature 
under which the wash step is carried out. Such wash temperatures are typically selected to be about 
S-X: to 2(rc lower than the thermal melting point (T J for the specific sequence at a defined ionic 
strength and pH. The T„ is the temperature (under defined ionic strength and pH) at which 50% of 
the target sequence hybridizes to a perfectiy matched probe. An equation for calculating T„ and 
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conditions for nucleic acid hybridization are weU known and can be found in Sambrook, J. et al., 
1989, Molecular Qon in p: A Laboratory Manual. ed., vol. 1-3, Cold Spring Harbor Press, 
Plainview NY; specifically see volume 2, chapter 9. 

High stringency conditions for hybridization between polynucleotides of the present 
invention include wash conditions of 68*^0 in the presence of about 0.2 x SSC and about 0.1% SDS, 
for 1 hour. Alternatively, temperatures of about 65°C, 60°C, 55°C, or 42°C may be used. SSC 
concentration may be varied from about 0.1 to 2 x SSC, with SDS being present at about 0.1%. 
Typically, blocking reagents are used to block non-specific hybridization. Such blocking reagents 
include, for instance, sheared and denatured salmon sperm DNA at about 100-200 ^ig/ml. Organic 
solvent, such as formamide at a concentration of about 35-50% v/v, may also be used under particular 
circumstances, such as for RNA:DNA hybridizations. Usefiil variations on fliese wash conditions 
will be readily apparent to those of ordinary skill in the art. Hybridization, particularly under high 
stringency conditions, may be suggestive of evolutionary similarity between the nucleotides. Such 
similarity is strongly indicative of a sunilar role for the nucleotides and their encoded polypeptides. 

The term "hybridization complex" refers to a complex formed between two nucleic acid 
sequences by virtue of the formation of hydrogen bonds between complementary bases. A 
.hybridization complex may be formed in solution (e.g., Cot or Rot analysis) or formed between one 
nucleic acid sequence present in solution and another nucleic acid sequence inmiobilized on a solid 
support (e.g., paper, membranes, filters, chips, pins or glass slides, or any other appropriate substrate 
to which cells or their nucleic acids have been fixed). 

The words "msertion" and "addition" refer to changes in an amino acid or nucleotide 
sequence resulting in the addition of one or more amino acid residues or nucleotides, respectively. 

"Immune response" can refer to conditions associated with inflammation, trauma, inomune 
disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression 
of various factors, e.g., cytokines, chemokines, and other signaling molecules, which may affect 
cellular and systemic defense systems. 

An "unmunogenic fragment" is a polypeptide or oligopeptide fragment of VFTRP which is 
capable of elicitmg an immune response when introduced into a living organism, for exan^le, a 
mammal. The term "immunogenic fragment also mcludes any polypeptide or oUgopeptide fragment 
of VETRP which is useful in any of the antibody production methods disclosed herein or known in 
the art 

The term "microarray" refers to an arrangement of a plurality of polynucleotides, 
polypeptides, or other chemical compounds on a substrate. 

The terms "element" and "airay element" refer to a polynucleotide, polypeptide, or other 
chemical compound having a unique and defined position on a microarray. 
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The tem "modulate" refers to a change in the activity of VETRP. For example, modulation 
may cause an increase or a decrease in protein activity, binding characteristics, or any other 
biolopcal, functional, or immunological properties of VETRP. 

The phrases "nucleic acid" and "nucleic acid sequence" refer to a nucleotide, oligonucleotide, 
5 polynucleotide, or any fragment thereof. These phrases also refer to DNA or RNA of genomic or 
syntiietic origin which may be single-stranded or double-stranded and may represent the sense or the 
antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-Iike material. 

"Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 
functional relationship with a second nucleic acid sequence. For instance, a promoter is operably 
10 linked to a coding sequence if the promoter affects the transcription or expression of the coding 
sequence. Operably linked DNA sequences may be in close proximity or contiguous and. where 
necessary to join two protein coding regions, in the same reading frame. 

"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agpnt which 
comprises an oUgonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of 
15 amino acid residues ending in lysine. The terminal lysine confers solubUity to the composition. 
PNAs preferentiaUy bind complementary single stranded DNA or RNA and stop transcript 
elongation, and may be pegylated to extend their lifespan in the cell. 

'Tost-translational modification" of an VETRPmay involve lipidation. glycosylation, 
phosphorylation, acetylation. racemization. proteolytic cleavage, and other modifications known in 
20 theart These processes may occur syntiietically or biochemically. Biochemical modifications will 
vary by cell type depending on the enzymatic milieu of VETRP. 

•TProbe" refers to nucleic acid sequences encoding VETRP, tiieir complements, or fragments 
thereof, which are used to detect identical, alleUc or related nucleic acid sequences. Probes are 
isolated oUgonucleotides or polynucleotides attached to adetectable label or reporter molecule. 
25 Typical labels include radioactive isotopes, ligands, chemUuminescent agents, and enzymes. 

"Primers" are short nucleic acids. usuaUy DNA oligonucleotides, which may be amiealed to a target 
polynucleotide by complementary base-pairing. The primer may then be extended along die target 
DNA strand by a DNA polymerase enzyme. Primer pairs can be used for amplification (and 
identification) of anuclac acid sequence, e.g.. by the polymerase chain reaction (PCR). 
30 Probes and primers as used in the present invention typicaUy comprise at least 15 contiguous 

nucleotides of aknown sequence. In order to enhance specificity, longer probes and primers may also 
be employed, such as probes and primers tiiat comprise at least 20, 25, 30, 40. 50. 60. 70. 80. 90. 100. 
or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers 
may be considerably longer tiian fliese examples, and it is understood diat any length supported by die 
35 specification, including die tables, figures, and Sequence Listing, may be used. 
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Methods for preparing and using probes and primers are described in the references, for 
example Sambrook. J. et al. (1989) Molmilar Cloning: A T ^horatory Manual, 2- ed., vol. 1-3. Cold 
Spring Harbor Press, Plainview NY; Ausubel, FJVl. et al. ( 1987) r.irr.pt Protocols in Molecular 
Biology. Greene Publ. Assoc. & Wiley-lntersdences. New York NY; bnis, M. et al. (1990) 
5 p.».^.i. A Gnide to i^^thnH...dAnnlications. Academic Press. San Diego CA. PCR primer pairs 
can be derived from a knovvn sequence, for example, by using computer programs intended for that 
purpose such as Primer (Version 0.5. 1991. Whitehead Institote for Biomedical Research. Cambridge 
MA). 

OHgpnucleotides for use as primers are selected using software known in the art for such 
10 purpose. Forexample.OUG04.06softwareisuseMfortheselectionofPCRprimerpairsofupto 
100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 
5.000 nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer 
selection progiams have incorporated additional features for expanded c^^abiUties. For example, the 
PriraOU primer selection program (available to the pubUc from the Genome Center at University of 
15 Texas South West Medical Center. Dallas TX) is capable of choosing specific primers from 

megabase sequences and is thus useful for designing primers on a genome-wide scope. The Primer3 
primer selection program (available to the pubUc from the Whitehead tostituteMTr Center for 
Genome Research. Cambridge MA) allows the user to input a "mispriming library." in which 
sequences to avoid as primer bmding sites are user-specified. Primer3 is useful, in particular, for the 
20 selectionofoUgonucleotidesformicroarrays. (The source code for the latter two primer selection 
programs may also be obtained from their respective sources and modified to meet the user' s specific 
needs.) The PrimeGen program (available to the public from the UK Human Genome Mapping 
Project Resource Centre. Cambridge UK) designs primers based on multiple sequence aUgnments. 
thereby allowing selection of primers that hybridize to either the most conserved or least conserved 
25 regionsofalignednucleicacidsequences. Hence, this program is useful for identification of both 
unique and conserved oUgonucleotides and polynucleotide firagments. The oligonucleotides and 
polynucleotide fragments identified by any of the above selection methods are useful in hybridization 
technologies, for example, as PCR or sequencing primers, microarray elements, or specific probes to 
identify fully or partially complementary polynucleotides in a sample of nucleic acids. Methods of 
30 oligonucleotide selection are not limited to those described above. 

A "recombinant nucleic acid" is a sequence that is not naturally occurring or has a sequence 
fliat is made by an artificial combination of two or more otherwise separated segments of sequence. 
This artificial combination is often accompUshed by chemical synthesis or. more commonly, by the 
artificial manipulation of isolated segments of nucleic acids. e.g.. by genetic engineering techniques 
35 suchasthosedescribedinSambrook,suEm- The term recombinant includes nucleic adds that have 
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been altered solely by addition, substitution, or deletion of aportion of the nucleic acid. Frequenfly. a 
recombinant nucleic acid may include a nucleic acid sequence operably linked to a promoter 
sequence. Such a recombinant nucleic add may be part of a vector that is used, for example, to 

transform a cell. 

5 Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 

vaccinia virus, that could be use to vaccinate a mammal wherein the recombmant nucleic acid is 
expressed, inducing a protective immunological response in die mammal. 

A "regulatory elemenf ' refers to a nucleic add sequence usually derived from untranslated 
regions of a gpne and includes enhancers, promoters, introns, and 5' and 3' untranslated regions 

10 (UTRs). Regulatory elements interact widi host or viral proteins which control transcription, 
translation, or RNA stability. 

"Reporter molecules" are chemical or biochemical moieties used for labeling a nucleic add, 
amino acid, or antibody. Reporter molecules include radionncUdes; enzymes; fluorescent, 
chemiluminescent, or chromogenic agents; substrates; cefaclors; inhibitors; magnetic particles; and . 

15 odier moieties known in the art. 

An "RNA equivalent," in reference to a DNA sequence, is composed of the same linear 
sequence of nucleotides as the reference DNA sequence with the exception that aU occurrences of Uie 
nitrogpnous base thymine are replaced with uradl. and the sugar backbone is composed of ribose 
instead of deoxyribose. 

20 The term "sample" is used in its broadest sense. A sample suspected of containing nucleic 

adds encoding VETRP, or fragments thereof, or VETRP itself, may comprise a bodily fluid; an 
extract from a cell, chromosome, organelle, or membrane isolated from a cell; a ceU; genomic DNA, 
RNA, or cDNA, in solution or bound to a substrate; a tissue; a tissue print; etc. 

The terms "specific binding" and "specifically bindinr refer to that interaction between a 
25 protein or peptide and an agonist, an antibody, an antagonist, a smaD molecule, or any natural or 
synthetic binding composition. The interaction is dependent upon die presence of a particular 
structure of the protein, e.g., the antigenic determinant or epitope, recognized by the binding 
molecule. Forexanq)le,if an antibody is specific for epitope "A," die presence of a polypeptide 
comprising the epitope A, or die presence of free unlabeled A, in a reaction containing free labeled A 
30 and flie antibody will reduce die amount of labeled A diat binds to the antibody. 

The term "substantially purified" refers to nucleic acid or amino add sequences that are 
removed from tiieir natural environment and are isolated or separated, and are at least 60% free, 
preferably at least 75% free, and most preferably at least 90% free from otiier components witi» which 
they are naturally associated. 
35 A "substitution" refers to the replacement of one or more amino acid residues or nucleotides 
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by different amino add residues or nucleotides, respectively. 

"Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 
chips, slides, wafers, fibers, magnetic or nomnagnetic beads, gels, tubing, plates, polymers, 
mcroparticles and capillaries. The substrate can have a variety of surface forms, such as weUs. 
trenches, pins, chamiels and pores, to which polynucleotides or polypeptides are bound. 

A "transcript image" refeiti to the coUective pattern of gene expression by a particular cell 
type or tissue under given conditions at a given time. 

"Transformation" describes a process by which exogenous DNA is introduced into a recipient 
cell. Transformation may occur under natural or artificial conditions according to various methods 
, weU known in the art, and may rely on any known method for the insertion of foreign nucleic acid 
sequences into a prokaryotic or eukaryotic host ceU. The method for transformation is selected based 
on the type of host cell being transformed and may include, but is not limited to, bacteriophage or 
viralinfection.electroponition,heatshock.Upofection.andparticlebombardment. The term 

"transformed" cells includes stably transformed cells in which the inserted DNA is capable of 
5 repUcation either as an autonomously repUcating plasmid or as part of the host chromosome, as well 
as transiendy transformed cells which express the inserted DNA or RNA for limited periods of time. 

A "transgenic organism." as used herein, is any organism, including but notlimited to 
animals and plants, in which one or more of the cells of .he organism contains heterologous nucleic 
add introduced by way of human intervention, such as by transgenic techniques well known in the 
20 art. Thenucleicacidisintroducedintothecell.directlyorindirecaybyintroductionintoaprecursor 
of the ceU, by way of deliberate genetic manipulation, such as by micromjection or by infection witii 
a recombmant vims. The term genetic manipulation does not include classical cross-breeding, or in 
vifio fertilization, but ratiier is directed to the introduction of a recombinant DNA molecule. The 
transgenic organisms contemplated in accorfance witi. the present invention include bacteria, 
25 cyanobacteria.fungi,plants.andanimals.TheisolatedDNAofthepresentinventioncanbe 

introduced into the host by methods known in the art, for example infection, transfection, 
transformation or transconjugation. Techniques for transferring the DNA of the present invention 
into such organisms are widely known and provided in references such as Sambrook, J. et al. (1989), 

supra . 

30 A •Wane- of a particular nucleic add sequence is defined as a nucleic acid sequence having 

at least 40% sequence identity to the particular nucleic add sequence over a certain length of one of 
the nucleic acid sequences using blastii with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
1999) set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at 
least 60%, at least 70%, at least 80%, at least 85%, at least 90%. at least 95%.or at least 98% or 

35 greater sequence identity over a certain defined length. A variant may be described as, for example. 
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an "alleUc" (as defined above), "splice." "species." or "polynrorphic" variant A splice variant may 
have significant identity to a reference molecule, but vdll generally have a greater or lesser number of 
polynucleotides due to alternative spUcing of exons during mRNA processing. Tbc corresponding 
polypeptide may possess additional functional domains or lack domains that are present in the 

5 reference molecule. Species variants are polynucleotide sequences that vary fix)m one species to 
another. The resulting polypeptides generaUy will have significant amino acid identity relative to 
each other. A polymorphic variant is a variation in the polynucleotide sequence of a particular gene 
between individuals of a given species. Polymorphic variants also may encompass "single nucleotide 
polymorphisms" (SNPs) in which the polynucleotide sequence varies by one nucleotide base. The 

10 presence of SNPs may be indicative of. for example, a certain population, a disease state, or a 

propensity for a disease state. 

A -Variant" of a particular polypeptide sequence is defined as a polypeptide sequence having 
at least 40% sequence identity to the particular polypeptide sequence over a certain length of one of 
the polypeptide sequences using blastp wifli the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
15 1999) set at default parameters. Such a pair of polypeptides may show, for example, at least 50%. at 
least 60%. at least 70%. at least 80%. at least 90%. at least 95%. or at least 98% or greater sequence 

identity over a certain defined lengA of one of the polypeptides. 
THE INVENTION 

The invention is based on the discovery of new human vesicle trafficking proteins (VETRP), 
20 the polynucleotides encoding VETRP, and the use of these compositions for the diagnosis, treatment, 
or prevention of vesicle trafficking disorders, autoimmune/inflammatory disorders, and cancer. 

Table 1 Usts die Incyte clones used to assemble fiiU length nucleotide sequences encoding 
VETRP. Columns 1 and 2 show the sequence identification numbers (SEQ ID NOs) of the 
polypeptide and nucleotide sequences, respectively. Column 3 shows the clone IDs of the Incyte 
25 clones in which nucleic acids encoding each VETRP were identified, and column 4 shows the cDNA 
Ubraries from which these clones were isolated. Column 5 shows Incyte clones and their 
corresponding cDNA Ubraries. Clones for which cDNA Ubraries are not indicated were derived from 
pooled cDNA libraries. In some cases. GenBank sequence identifiers are also shown in column 5. 
The Incyte clones and GenBank cDNA sequences, where indicated, in column 5 were used to 
30 assemble the consensus nucleotide sequence of each VETRP and are usefiil as fragments in 
hybridization technolo^es. 

The cotamns of Table 2 show various properties of each of the polypeptides of the invention: 
cotamn 1 references the SEQ ID NO; column 2 shows the number of amino acid residues in each 
polypeptide; column 3 shows potential phosphorylation sites; colmnn 4 shows potential glycosylation 
35 sites; column 5 shows the amino acid residues comprising signature sequences and motifs; column 6 
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shows homologous sequences as identified by BLAST analysis along with relevant citations, all of 
which are expressly incorporated by reference herein in their entirety; and column 7 shows analytical 
methods and in some cases, searchable databases to which the analytical methods were applied. The 
methods of column 7 were used to characterize each polypeptide through sequence homology and 
5 protein motifs. 

The columns of Table 3 show the tissue-specificity and diseases, disorders, or conditions 
associated with nucleotide sequences encoding VETRP. The first column of Table 3 lists the 
nucleotide SEQ ID NOs. Cohimn 2 lists fragments of the nucleotide sequences of cohimnl. These 
fragments are useful, for example, in hybridization or amplification technologies to identify SEQ ID 

10 NO:24^ and to distinguish between SEQ ID NO:24^ and related polynucleotide sequences. Hie 
polypeptides encoded by tiiese fragments are useM, for example, as immunogenic peptides. Column 
3 lists tissue categories which express VETRP as a fraction of total tissues expressing VETRP. 
Column 4 lists diseases, disorders, or conditions associated with tiiose tissues expressing VETRP as a 
fraction of total tissues expressing VETRP. Column 5 lists die vectors used to subclone each cDNA 

15 Ubrary. Of particular note is tiie expression of SEQ ID NO:25 in nervous tissue. SEQIDNO:41is 
noted for its expression in both cancer and reproductive tissue, and SEQ ID NO:43 is expressed in 

cancer and nervous tissue. 

The cotarons of Table 4 show descriptions of die tissues used to construct tiie cDNA libraries 
from which cDNA clones encoding VETRP were isolated. Column 1 references tiie nucleotide SEQ 
20 ID NOs, column 2 shows the cDNA libraries from which fliese clones were isolatedv and column 3 
shows die tissue origins and otiier descriptive information relevant to die cDNA libraries in column 2. 

The invention also encompasses VETRP variants. A preferred VETRP variant is one which 
has at least about 80%. or alternatively at least about 90%, or even at least about 95% amino acid 
sequence identity to die VETRP amino acid sequence, and which contams at least one functional or 
25 stractural characteristic of VETRP. 

SEQ ID NO:31 maps to chromosome 12 within Uie interval from 70.60 to 76.50 
centiMorgans, and to chromosome 1 within die interval from 159.60 to 164.10 centiMorgans. SEQ 
ID NO:36 maps to chromosome 3 witiiin tiie interval from 129.00 to 131.80 centiMorgans. and to 
chromosome 4 widun die interval from 86.00 to 91.90 centiMorgans. SEQ ID NO:38 maps to 
30 chromosome 6 within die interval from die p-terminus to 27.10 centiMorgans. SEQ ID NO:42 maps 
to chromosome 2 wifliin die interval from 233.10 to 236.10 centiMorgans. SEQ ID NO:44 maps to 
chromosome 5 widiin die interval from 61.10 to 69.60 centiMorgans, to chromosome 11 widun die 
interval from 1 17.90 to 123.50 centiMorgans, and to chromosome 17 widiin flie interval from 99.30 
to 103.70 centiMorgans. 

35 The invention also encompasses polynucleotides which encode VETRP. In a particular 
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embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected 
from the group consisting of SEQ ID NO:2446, which encodes VETRP. The polynucleotide 
sequences of SEQ ID NO:2446. as presented in the Sequence Listing, embrace the equivalent RNA 
sequences, wherein occurrences of the nitrogenous base thymine are replaced with uracU, and the 
sugar backbone is con^osed of ribose instead of deoxyribose. 

The invention also encompasses a variant of a polynucleotide sequence encoding VETRP. In 
particular, such a variant polynucleotide sequence will have at least about 70%. or alternatively at 
least about 85%. or even at least about 95% polynucleotide sequence identity to the polynucleotide 
sequence encoding VETRP. A particular aspect of the mvention encompasses a variant of a 
10 polynucleotide sequence comprising a sequence selected from the group consisting of SEQ ID 
NO:24^ which has at least about 70%, or alternatively at least about 85%, or even at least about 
95% polynucleotide sequence identity to a nucleic acid sequence selected from the group consisting 
of SEQ ID NO:2446. Any one of the polynucleotide variants described above can encode an amino 
acid sequence which contains at least one functional or structural characteristic of VETRP. 
15 It will be appreciated by those skilled in the art that as a result of the degeneracy of the 

genetic code, a multitude of polynucleotide sequences encoding VETRP. some bearing minimal 
simflarity to the polynucleotide sequences of any known and naturally occurring gene, may be 
produced. Thus, the invention contemplates each and every possible variation of polynucleotide 
sequence that could be made by selecting combinations based on possible codon choices. These 
20 combinations arc made in accordance with the standard triplet gpnetic code as applied to the 

polynucleotide sequence of naturally occurring VETRP. and all such variations are to be considered 

as being specifically disclosed. 

Although nucleotide sequences which encode VETRP and its variants are generally capable 
of hybridizing to the nucleotide sequence of the naturaUy occurring VETRP under appropriately 
25 selected conditions of stringency, it may be advantageous to produce nucleotide sequences encoding 
VETRP or its derivatives possessing a substantially different codon usage, e.g., inclusion of non- 
natuially occurring codons. Codons may be selected to increase the rate at which expression of the 
peptide occurs in a particular prokaryotic or eukaryotic host in accordance with the frequency with 
which particular codons are utilized by tiie host. Other reasons for substantially altering the 
30 nucleotide sequence encoding VETRP and its derivatives witiiout altering tiie encoded amino acid 
sequences include tiie production of RNA transcripts having more desirable properties, such as a 
greater half-life, than transcripts produced from flie naturally occurring sequence. 

The invention also encompasses production of DNA sequences which encode VETRP and 
VETRP derivatives, or fragments thereof, entirely by syntiietic chemistry. After production, die 
35 synthetic sequence may be inserted into any of the many available expression vectors and cell 
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systems using reagents weU known in the art. Moreover, synthetic chemistry may be used to 
introduce mutations into a sequence encoding VETRP or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are capable of 
hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID 
NO-24^ and fragments thereof under various conditions of stringency. (See, e.g.. Wahl. GM. and 
SL Berger(1987) Methods Enzymol. 152:39^; Kimmel. AJl. (1987) Methods EnzymoL 
152:507-511.) Hybridization conditions, including annealing and wash conditions, are described m 
"Definitions" 

Methods for DNA sequencing are well known in the art and may be used to practice any of 
, theembodimentsoftheinvention. m me*ods may employ such enzymes as the Klenow fragment 
of DNA polymerase I. SEQUENASE (US Biochemical, Cleveland OH). Taq polymerase (Applied 
Biosystems. Foster Qty CA). thermostable T7 polymerase (Amersham Pharmacia Biotech. 
HscatawayNJ). or combinations of polymerases and proofreading exonucleas^ 
in the ELONGASE an^lification system (Life Technologies. Gaithersburg MD). Preferably. 
5 sequence preparation is automated with machines such as the MICROLAB 2200 Uquid transfer 
system (Hamilton. Reno NV). PTC200 thermal cycler (MJ Researeh. Watertown MA) and ABI 
CATALYST 800 thermal cycler (AppUed Biosystems). Sequencing is then carried out usmg either 
the ABI 373 or 377 DNA sequencing system (Applied Biosystems). the MEGAB ACE 1000 DNA 
sequencing system (Molecular Dynamics. Sunnyvale CA). or other systems known in the art. The 
>0 resultingsequencesareanalyzedusingavarietyofalgorithmswhicharewellknownintheart (S^ 

e g Ausubel. FM. (1997) Shortftotocgls inMole^ilarBiolo^ John Wiley & Sons, New York NY, 
unit\.7;Meyers,R.A.(1995)MolecuhLBid^^ 

856-853.) . , , . . 

-ne nucleic acid sequences encoding VETRP may be extended utilizing a part«l nucleotide 
25 sequenceandemployingvariousPCR-basedmethodsknowninthearttodetectupstreams^^^^^ 
such as promoters and regulatory elements. For example, one method which may be employed. 

restriction-site PCR, uses universal and nested primers to amplify unknown sequence from genomic 
DNA within a cloning vector. (See. e.g.. Sarkar. G. (1993) PCR Methods Applic. 2:318-322.) 
Mother med.od,inversePCR. uses primers that extend in divergent directions to amplify unknot 

30 sequencefromacireularizedtemplate. m template is derived from restriction fragments compnsmg 
a known genomic locus and surrounding sequences. (See. e.g.. Triglia, T. et al. (1988) Nucleic Acids 
Res 16-8186.) A third method, capture PCR, involves PCR amplification of DNA ftagments 
adjacent toknown sequences in human and yeast artificial chromosome DNA. (See. e.g..Lage^^^^ 
M etal (l991)PCRMethods Applic. 1:111-119.) In this method, multiple restriction enzyme 
digestions and Ugations may be used to insert an engineered double-stranded sequence into a region 



35 
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of unknovvn sequence before performmgPCR. Other methods which nmy be used to retrieve 
unknown sequences are known in the art. (See. e.g.. Parker, J J), et al. (1991) Nucleic Acids Res. 
19-3055-3060). Additionally, one ™ay use PCR. nested printers, and PROMOIERFINDER libraries 
(Qontech. Palo Alto CA) to walk genonuc DNA. TOs procedure avoids the need to screen Ubranes 
5 andisusefulinfindingintron/exonjunctions. For aU PCR-based methods, primers may be designed 
using commercially available software, such as OLIGO 4.06 Primer Analysis software (National 
Biosciences. Plymouth MN) or another appropriate program, to be about 22 to 30 nucleotides m - 
length, to have a GC content of about 50% or more, and to am>eal to the ten^late at temperatures of 
about 68°C to 72°C. 

10 Whenscreeningforfull-lengthcDNAs,itispreferabletouseUbrariesthathavebeen 
si^s«lected to inctode larger cDNAs. In addition, random-primed Ubraries. which often include 
sequences containing the 5' regions of genes, are preferable for situations in which an oligo d(T) 
UbrarydoesnotyieldafuU-lengthcDNA. Genomic Ubraries may beuseful for extension of sequent 

into 5* non-transcribed regulatory regions. 
15 Capillary electrophoresis systems which are commercially available may be used to analyze 

the size or confirm the nucleotide sequence of sequencing or PGR products. In particular, capillary 
sequencing may employ flowable polymers for electrophoretic separation, four different nucleoUde- 
specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
emitted wavelengths. Outputmght intensity may be converted to electrical signal using appropriate 
20 software (e.g.. GENCXTYPER and SEQUENCE NAVIGATOR, Applied Biosystems). and the entire 
process from loading of samples to computer analysis and electronic data display may be computer 
controlled. Capillary electrophoresis is especially preferable for sequencing smaU DNA fragments 
which may be present in limited amounts in a particular sample. 

In another embodiment of the mvention. polynucleotide sequences or fragments thereof 
25 which encode VBTRP may be cloned in recombinant DNA molecules that direct expression of 

VETRP. or fragments or fimctional equivalents thereof, in appropriate host cells. Due to the inherent 
degeneracy of the g^etic code, other DNA sequences which encode substantially the same or a 
functionally equivalent amino acid sequence may be produced and used to express VETRP. 

The nucleotide sequences of the present invention can be engineered using methods generally 
30 known in the art in order to alter VEIRP-encoding sequences for a variety of purposes including, but 
not limited to. modification of the cloning, processing, and/or expression of the gene product. DNA 
shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic 
oUgonucleotides may be used to engineer the nucleotide sequences. For example. oUgonucleotide- 
mediated site^ted mutagenesis may be used to introduce mutations that create new restriction 
35 sites, alter glycosylation patterns, change codon preference, produce spUce variants, and so forth. 
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The nucleotides of the present invention may be subjected to DNA shuffling techniques such 
as MOLECULARBREEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent Number 
5,837,458; Chang, C.-C. et al. (1999) Nat. Biotechnol. 17:793-797; Christians, F.C. et al. (1999) Nat. 
Biotechnol. 17:259-264; and Crameri, A. et al. (1996) Nat. Biotechnol. 14:315-319) to alter or 

5 improve the biological properties of VETRP, such as its biological or enzymatic activity or its ability 
to bind to other molecules or con^unds. DNA shuffling is a process by which a library of gene 
variants is produced using PCR-mediated recombination of gene fragments. The library is then 
subjected to selection or screening procedures that identify those gene variants vwth the desired 
properties. These preferred variants may then be pooled and furttier subjected to recursive rounds of 

10 DNA shuffling and selection/screening. Thus, genetic diversity is created tfirougji "artificial" 

breeding and rapid molecular evolution. For example, fragments of a single gene containing random 
point mutations may be recorabined, screened, and then reshuffled until the desired properties are 
optimized. Alternatively, fragments of a given gene may be recombined with fragments of 
homologous genes in the same gene family, either from the same or different species, thereby 

15 maximizing the genetic diversity of multiple naturally occurring genes in a directed and controUable 
manner. 

In another embodiment, sequences encoding VETRP may be synthesized, in whole or in part, 
using chemical methods well known in the art. (See, e.g., Carathers, M.H. et al. (1980) Nucleic Acids 
Symp. Ser. 7:215-223; Horn, T. et al. (1980) Nucleic Acids Symp. Ser. 7:225-232.) Alternatively, 

20 VETRP itself or a fragment thereof may be synthesized using chemical methods. For example, 

peptide synthesis can be performed using various solution-phase or solid-phase techniques. (See, e.g., 
Creighton, T. (1984) Proteins, Structures and Molecular Properties, WH Freenaan, New York NY, pp. 
55-60; and Roberge, J.Y. et al. (1995) Science 269:202-204.) Automated synthesis may be achieved 
using the ABI 431A peptide synthesizer (Applied Biosystems). Additionally, the amino acid 

25 sequence of VETRP, or any part thereof, may be altered during direct synthesis and/or combined with 
sequences from other proteins, or any part thereof, to produce a variant polypeptide or a polypeptide 
having a sequence of a naturally occurring polypeptide. 

The peptide may be substantially purified by preparative high performance liquid 
chromatography. (See, e.g., Chiez, RJvf. and F.Z. Regnier (1990) Methods Enzymol. 182:392^21.) 

30 The composition of the synthetic peptides may be confirmed by anuno acid analysis or by 
sequencing. (See, e,g., Creighton, supra, pp. 28-53.) 

In order to express a biologically active VETRP, the nucleotide sequences encoding VETRP 
or derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector which 
contains the necessary elements for transcriptional and translational control of the mserted coding 

35 sequence in a suitable host These elements include regulatory sequences, such as enhancers, 

30 
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constitutive and inducible promoters, and 5' and 3' untranslated regions in the vector and in 
polynucleotide sequences encoding VETRP. Such elements may vary in their strength and 
specificity. Specific initiation signals may also be used to achieve more efficient translation of 
sequences encoding VETRP. Such signals include the ATG initiation codon and adjacent sequences, 
5 e.g. the Kozak sequence. In cases where sequences encoding VETRP and its initiation codon and 
upstream regulatory sequences are inserted into the appropriate expression vector, no additional 
transcriptional or translational control signals may be needed. However, in cases where only coding 
sequence, or a fragment thereof, is inserted, exogenous translational control signals inchiding an in- 
frame ATG mitiation codon should be provided by the vector. Exogenous translational elements and 
10 initiationcodonsmaybeofvariousorigins.bothnaturalandsynthetic. Theefficiency of expression 
may be enhanced by the inclusion of enhancers appropriate for the particular host cell system used. 
(See. e.g., Scharf, D. et al. (1994) Results Probl. Cell Differ. 20:125-162.) 

Methods which are well known to those skilled in the art may be used to construct expression 
vectors containing sequences encoding VETRP and appropriate transcriptional and translational 
15 control elements. These metiiods include in vitro recombinant DNA techniques, synthetic techniques, 
and invivo genetic recombination. (See. e.g., Sambrook, J. et al. (1989) Molecular Cloning, A 
T.ahoratorv Manual. Cold Spring Harbor Press, Plainview NY, ch. 4. 8. and 16-17; Ausubel. F.M. et 
al. (1995) riirmnt Protocol^: in Molecular Biology. John Wiley & Sons. New YorkNY, ch. 9, 13. and 
16.) 

20 A variety of expression vector/host systems may be utilized to contain and express sequences 

encoding VETRP. These include, but are not limited to, microorganisms such as bacteria transformed 
with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed, with 
yeast expression vectors; insect cell systems infected with viral expression vectors (e.g.. baculovirus); 
plant cell systems transformed with viral expression vectors (e.g., cauliflower mosaic virus, CaMV, 
25 or tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or 
animal cell systems. (See, e.g., Sambrook, sues; Ausubel, supra; Van Heeke. G. and S J^. Schuster 
(1989) J. Biol Chem. 264:5503-5509; Bitter, GA. et al. (1987) Methods Enzymol. 153:516-544; 
Scorer, CA. et al. (1994) Bio/Technology 12:181-184; Engelhard, E.K. et al. (1994) Proc. Nad. 
Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937-1945; Takamatsu, 

30 N. (1987) EMBO J. 6:307-31 1; Coruzzi, G. et al. (1984) EMBO L 3: 1671-1680; Broglie, R. et al. 
(1984) Science 224:838-843; Winter, J. et al. (1991) Results Probl. CeU Differ. 17:85-105; The 
McGraw Hill Yearbook of Science and Technology (1992) McGraw HiU, New York NY. pp. 
191-196; Logan. J. and T. Shenk (1984) Proc. Nafl. Acad. Sci. USA 81:3655-3659; and Harrington, 
J J. et al. (1997) Nat. Genet. 15:345-355.) Expression vectors derived from retroviruses, 

35 adenoviruses, or herpes or vaccinia viruses, or from various bacterial plasmids, may be used for 
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deUvery of nucleotide sequences to the tar^ted organ, tissue, or ceU population. (See, e.g., Di 
Nicola, M. et al. (1998) Cancer Gen. Then 5(6):350-356; Yu, M. et al. (1993) Proc. Nad. Acad. Sci. 
USA 9(K13):6340-6344; BuUer. R.M. et al. (1985) Nature 317(6040):813-815; McGregor, DP. et al. 
(1994) Mol. Immunol. 31(3):219-226; and Verma, IM. and N. Somia (1997) Nature 389:239-242.) 
5 The invention is not limited by the host cell employed. 

In bacterial systems, a number of cloning and expression vectors may be selected depending 
upon the use intended for polynucleotide sequences encoding VETRP. For example, routine cloning, 
subcloning, and propagation of polynucleotide sequences encoding VETRP can be achieved using a 
raultifimctional E.coK vector such as PBLUESCRIPT (Stratagene. La JoUa CA) or PSPORTl 
10 plasmid (Life Technologies). Ligation of sequences encoding VETRP into the vector's multiple 
cloning site disrupts the lacZ gpne, allowing a colorimetric screening procedure for identification of 
transformed bacteria containing recombinant molecules. In addition, these vectors may be useful for 
in vitro transcription, dideoxy sequencing, single strand rescue with helper phage, and creation of 
nested deletions in the cloned sequence. (See, e.g.. Van Heeke, G. and S J^. Schuster (1989) J. Biol. 
15 Chem. 264:5503-5509.) When large quantities of VETRP are needed, e.g. for the production of 
antibodies, vectors which direct high level expression of VETRP may be used. For example, vectors 
containing the strong, inducible T5 or TV bacteriophage promoter may be used. 

Yeast expression systems may be used for production of VETRP. A number of vectors 
containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH 
20 promoter-, miy """^ y""^ .SarrhamTTivces cerevisiae or Pichia pastoris . In addition, such 
vectors direct either the secretion or intraceUular retention of expressed proteins and enable 
integration of foreign sequences into the host genome for stable propagation. (See, e.g., Ausubel, 
1995, supra: Bitter, supra : and Scorer, supra.) 

Plant systems may also be used for expression of VETRP. Transcription of sequences 
25 encoding VETRP may be driven viral promoters. e.g.. the 35S and 19S promoters of CaMV used 
alone or in combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 
6:307-31 1). Alternatively, plant promoters such as the smaU subuuit of RUBISCO or heat shock 
promoters may be used. (See. e.g., Comzzi. suEra; BrogUe. supra; and Winter, mm.) These 
constructs can be introduced into plant cells by direct DNA transformation or pathogen-mediated 
30 transfection. (See. e.g.. The McGraw HiU Ye^rhc^k of Science and Technology (1992) McGraw 
Hill, New York NY. pp. 191-196.) 

In mammalian cells, a number of viral-based expression systems may be utilized. In cases 
where an adenovirus is used as an expression vector, sequences encoding VETRP may be Ugated into 
an adenoviras transcription/translation complex consisting of the late promoter and tripartite leader 
35 sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to obtairi 
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infective viras which expresses VETRP in host cells, (See, e.g., Logan, J. and T. Shenk (1984) Proc. 
Natl. Acad. Sci, USA 81:3655-3659.) In addition, transcription enhancers, such as the Rous sarconia 
virus (RSV) enhancer, may be used to increase expression in mammalian host cells. SV40 or EBV- 
based vectors may also be used for high-level protein expression. 
5 Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of 

DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 
constracted arid delivered via conventional delivery methods (liposomes, polycationic ammo 
polymers, or vesicles) for therapeutic purposes. (See, e.g., Harrington, J.J. et al. (1997) Nat. Genet 
15:345-355.) 

10 For long term production of recombinant proteins in mammalian systems, stable expression 

of VETRP in cell lines is preferred. For example, sequences encoding VETRP can be transformed 
into cell lines using expression vectors which may contain viral origins of replication and/or 
endogenous expression elements and a selectable marker gene on the same or on a separate vector. 
Following the introduction of the vector, cells may be allowed to grow for about 1 to 2 days m 

15 enriched media before being switched to selective media. The purpose of the selectable marker is to 
confer resistance to a selective agent, and its presence allows growth and recovery of cells which 
successfully express the introduced sequences. Resistant clones of stably transformed cells may be 
propagated using tissue culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines. These 

20 include, but are not limited to, the herpes simplex virus thymidine kinase and adenine 

phosphoribosyltransferase genes, for use in tk' and apr cells, respectively. (See, e.g., Wigler, M. et 
al. (1977) Cell 11:223-232; Lowy, I. et al. (1980) Cell 22:817-823.) Also, antimetabolite, antibiotic, 
or herbicide resistance can be used as tiie basis for selection. For example, dhfr confers resistance to 
methotrexate; neo confers resistance to the anrinoglycosides neomycin and G-418; and als and pat 

25 confer resistance to chlorsulfuron and phosphinotricin acetyltransferase, respectively. (See, e.g., 
Wigler, M. et al. (1980) Proc. Nad. Acad. Sci. USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) 
J. Mol. Biol. 150:1-14.) Additional selectable genes have been described, e.g., trpB and hisD, which 
alter cellular requirements for metabolites. (See, e.g., Hartman, S.C. and R.C. Mulligan (1988) Proc. 
Nafl. Acad. Sci. USA 85:8047-8051.) Visible markers, e.g., anthocyanins, green fluorescent proteins 

30 (GFP; Clontech), B glucuronidase and its substrate B-glucuronide, or luciferase and its substrate 
luciferin may be used. These markers can be used not only to identify transformants, but also to 
quantify the amount of transient or stable protein expression attributable to a specific vector system. 
(See, e.g., Rhodes, CA. (1995) Metiiods Mol. Biol. 55:121-131.) 

Although the presence/absence of marker gene expression suggests that the gene of interest is 

35 also present, the presence and expression of the gene may need to be confirmed. For example, if the 
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sequence encoding VETRP is inserted within a marker gene sequence, transformed cells containing 
sequences encoding VETRP can be identified by the absence of naarker gene function. Alternatively, 
a marker gene can be placed in tandem with a sequence encoding VETRP under the control of a 
single promoter. Expression of the marker gene in response to induction or selection usually 

5 indicates expression of the tandem gene as well. 

In general, host cells that contain the nucleic acid sequence encoding VETRP and that 
express VETRP may be identified by a variety of procedures known to those of skill in the art. These 
procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PGR 
anq>lification, and protein bioassay or immunoassay techniques which include membrane, solution, or 

10 chip based technologies for the detection and/or quantification of nucleic acid or protein sequences. 

Immunological methods for detecting and measuring die expression of VETRP using either 
specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques 
include enzyme-linked immunosorbent assays (ELKAs), radioimmunoassays (RIAs), and 
fluorescence activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing 

15 monoclonal antibodies reactive to two non-interfering epitopes on VETRP is preferred, but a 

competitive binding assay may be employed. These and other assays are well known in the art. (See, 
e.g., Hampton, R. et al. (1990) Serological Methods, a Laboratorv Manual. APS Press, St. Paul MN, 
Sect. IV; Coligan, J.E. et al. (1997) Current Protocols in Immunology, Greene Pub. Associates and 
Wiley-Interscience, New York NY; and Pound, J.D. (1998) Immunochemical Protocols, Humana 

20 Press, Totowa NJ.) 

A wide variety of labels and conjugation techniques are known by those skilled in the art and 
may be used in various nucleic acid and amino acid assays. Means for producing labeled 
hybridization or PGR probes for detecting sequences related to polynucleotides encoding VETRP 
mclude oligolabeling, nick translation, end-labeling, or PGR amplification using a labeled nucleotide. 

25 Alternatively, the sequences encoding VETRP, or any ftagments thereof, may be cloned into a vector 
for the production of an mRNA probe. Such vectors are known in the art, are commercially available, 
and may be used to synthesize RNA probes in vitro by addition of an appropriate RNA polymerase 
such as T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted using a variety 
of commercially available kits, such as those provided by Amersham Pharmacia Biotech, Promega 

30 (Madison WI), and US Biochemical. Suitable reporter molecules or labels which may be used for 
ease of detection include radionuclides, enzymes, fluorescent, cheraihuninescent, or chromogenic 
agents, as well as substrates, cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with nucleotide sequences encoding VETRP may be cultured under 
conditions suitable for the expression and recovery of the protein firom cell culture. The protein 

35 produced by a transformed cell may be secreted or retained intracellularly depending on the sequence 
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and/or the vector used. As will be understood by those of skill in the art, expression vectors 
containing polynucleotides which encode VETRP may be designed to contain signal sequences which 
direct secretion of VETRP through a prokaryotic or eukaryotic cell membrane. 

In addition, a host cell strain may be chosen for its ability to modulate expression of the 

5 inserted sequences or to process the expressed protein in the desired fashion. Such modifications of 
the polypeptide inchide, but are not linrited to, acetylation, carboxylation, glycosylation. 
phosphorylation, lipidation, and acylation. Post-translational processing which cleaves a "prepro" or 
"pro" form of the protein may also be used to specify protein targeting, folding, and/or activity. 
Different host cells which have specific cellular machinery and characteristic mechanisms for 

10 post-translational activities (e.g., CHO, HeLa, MDCK, HEK293, and WB8) are available from the 
American Type Culture Collection (ATCC, Manassas VA) and may be chosen to ensure the correct 
modification and processing of the foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant nucleic acid 
sequences encoding VETRP may be Ugated to a heterologous sequence resulting in translation of a 

15 fusion protein in any of the aforementioned host systems. For example, a chimeric VETRP protem 
containing a heterologous moiety that can be recognized by a commercially available antibody may 
facilitate the screening of peptide libraries for inhibitors of VETRP activity. Heterologous protein 
and peptide moieties may also facilitate purification of fusion proteins using commercially available 
affinity matrices. Such moieties include, but are not limited to, glutatiiione S-transferase (GST), 

20 maltose binding protein QVIBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, 
c-myc, and hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification of their 
cognate fusion proteins on immobilized glutathione, maltose, phenylarsine oxide, cabnodulin, and 
n»tal-chelate resins, respectively. FLAG, c-/wyc, and hemagglutinin (HA) enable immunoaffinity 
purification of fusion proteins using commercially available monoclonal and polyclonal antibodies 

25 that specifically recognize tiiese epitope tags. A fusion protein may also be engineered to contain a 
proteolytic cleavage site located between tiie VETRP encoding sequence and die heterologous protein 
sequence, so that VETRP may be cleaved away from tiie heterologous moiety following purification. 
Methods for fusion protein expression and purification are discussed in Ausubel (1995, su^ ch. 10), 
A variety of commercially available kits may also be used to facilitate expression and purification of 

30 fusion proteins. 

In a fiirther embodiment of the invention, syntiiesis of radiolabeled VETRP may be achieved 
in vitro using the TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These 
systems couple transcription and translation of protein-coding sequences operably associated with Uie 
T7, T3, or SP6 promoters. Translation takes place in the presence of a radiolabeled amino acid 
35 precursor, for example, ^*S-methionine. 
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VETRP of the present invention or fragments fliereof may be used to screen for compounds 
that specifically bind to VETRP. At least one and up to a plurality of test compounds may be 
screened for specific binding to VETRP. Examples of test compounds inchide antibodies, 
oligonucleotides, proteins (e.g., receptors), or small molecules. 
5 In one embodiment, the compound thus identified is closely related to the natural ligand of 

VETRP, e.g., a ligand or fragment thereof, a natural substrate, a structural or functional mimetic, or a 
natural binding partner. (See. 6.g., Coligan. JE. et al. (1991) Oinnnt Protocols in Immunology 1(2): 
Chapter 5.) Similarly, the compound can be closely related to the natural receptor to which VETRP 
binds, or to at least a fragment of the receptor, e.g., the ligand binding site. In either case, the 
10 compound can be rationaUy designed using known techniques. In one embodiment, screening for 
these compounds involves producing appropriate cells which express VETRP. either as a secreted 
protein or on the cell membrane. Preferred cells inchade cells from mammals, yeast, DrosophUa, or 
E.coU . Cells expressing VEITIP or ceU membrane fractions which contain VETRP are then 
contacted with a test compound and binding, stimulation, or inhibition of activity of either VETRP or 
IS the compound is analyzed. 

An assay may simply test binding of a test compound to the polypeptide, wherein binding is 
detected by a fhiorophore, radioisotope!, enzyme conjugate, or oflier detectable label. For example, 
the assay may comprise the steps of combining at least one test compound with VETRP, either in 
solution or affixed to a solid support, and detectmg the binding of VETRP to the compound. 
20 Alternatively, the assay may detect or measure binding of a test compound in the presence of a 

labeled competitor. Additionally, the assay may be carried out using cell-free preparations, chemical 
Ubraries. or natural product mixtures, and the test compound(s) may be free in solution or afBxed to a 
solid support. 

VETRP of the present invention or fragments thereof may be used to screen for compounds 
25 that modulate the activity of VETRP. Such compounds may include agonists, antagonists, or partial 
or inverse agonists. In one embodiment, an assay is performed under conditions permissive for 
VETRP activity, wherein VETRP is combined with at least one test compound, and the activity of 
VETRP in die presence of a test compound is compared with the activity of VETRP in the absence of 
the test compound. A change in the activity of VETRP in the presence of the test compound is 
30 indicative of a compound fliat modulates the activity of VETRP- Alternatively, a test compound is 
combined with an in vitro or ceU-free system comprising VETRP under conditions suitable for 
VETRP activity, and the assay is performed. In either of these assays, a test compound which 
modulates the activity of VETRP may do so indiiecdy and need not come in direct contact with the 
test compound. At least one and up to a plurality of test compounds may be screened. 
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In another embodiment, polynucleotides encoding VETRP or their mammaUan homologs 
may be 'Tmocked out" in an animal model system using homologous recombination in embryonic 
stem (ES) cells. Such techniques are well known in the art and are useful for the generation of aninoal 
models of human disease. (See, e.g., U.S. Patent No. 5,175,383 and U.S. Patent No. 5,767,337.) For 
5 example, mouse ES cells, such as the mouse 129/SvJ ceU line, are derived from the early mouse 
embryo and grown in culture. The ES cells are transformed with a vector containing the gene of 
interest disrapted by a marker gene, e.g., the neomycin phosphotransferase gene (neo; Capecchi, 
M.R. (1989) Science 244:1288-1292). The vector integrates into the corresponding region of the host 
genome by homologous recombination. Alternatively, homologous recombination takes place using 
10 the Cre-loxP system to knockout a gene of mterest in a tissue- or developmental stage-specific 
manner (Marth, J JD. (1996) Clin. Invest. 97:1999-2002; Wagner, KU. et al. (1997) Nucleic Acids 
Res. 25:4323-4330). Transformed ES cells are identified and microinjected into mouse cell 
blastocysts such as those from the C57BL/6 mouse stram. The blastocysts are surgicaUy transferred 
to pseudopregnant dams, and the resulting chimeric progeny are genotyped and bred to produce 
15 heterozygous or homozygous strams. Transgenic aninaals thus generated may be tested with potential 
therapeutic or toxic agents. 

Polynucleotides encoding VETRP may also be manipulated in vitro in ES cells derived from 
human blastocysts. Human ES cells have the potential to differentiate into at least eight separate cell 
Uneages including endoderm, mesoderm, and ectodermal cell types. These cell lineages differentiate 
20 into, for example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, J .A. et al. 
(1998) Science 282:1145-1147). 

Polynucleotides encoding VETRP can also be used to create *Taiockin" humanized animals 
(pigs) or transgenic animals (mice or rats) to model human disease. With knockin technology, a 
region of a polynucleotide encoding VETRP is injected into animal ES cells, and the injected 
25 sequence integrates into the animal cell genome. Transformed cells are injected into blastulae, and 
the blastulae are unplanted as described above. Transgenic progeny or inbred lines are studied and 
treated with potential pharmaceutical agents to obtain information on treatment of a human disease. 
Alternatively, a mammal inbred to overexpress VETRP, e,g., by secreting VETRP in its milk, may 
also serve as a convenient source of that protein (Janne, L et al. (1998) Biotechnol. Annu. Rev. 4:55- 
30 74). 

THERAPEUTICS 

Chemical and structural similarity, e.g., in the context of sequences and motifs, exists 
between re^ons of VETRP and vesicle traffickmg proteins. In addition, the expression of VETRP is 
closely associated with reproductive tissue, nervous tissue, cancer and inflammation/trauma. 
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Therefore, VETRP appears to play a role in vesicle trafficking disorders, autoinmmnermflammatory 
disorders, and cancer. In the treatment of disorders associated with increased VETRP expression or 
activity, it is desirable to decrease the expression or activity of VETRP. In the treatment of disorders 
associated with decreased VETRP expression or activity, it is desirable to increase the expression or 

5 activity of VETRP. 

Therefore, in one embodiment, VETRP or a fragment or derivative thereof may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of VETRP. Examples of such disorders include, but are not limited to, a vesicle trafficking 
disorder, such as cystic fibrosis, glucose-galactose malabsorption syndrome, hypercholesterolemia, 

10 diabetes mellitus, diabetes insipidus, hyper- and hypoglycemia. Grave's disease, goiter, Cushing's 
disease, and Addison's disease; gastrointestinal disorders including ulcerative colitis, gastric and 
duodenal ulcers; other conditions associated with abnormal vesicle trafficking, includmg acquired 
immunodeficiency syndrome (AIDS); allergies including hay fever, asthma, and urticaria (hives); 
autoimmune hemolytic anemia; proliferative glomerulonephritis; inflammatory bowel disease; 

15 multiple sclerosis; myasthenia gravis; rheumatoid and osteoarthritis; scleroderma; Chediak-Higashi 
and Sjogren's syndromes; systemic lupus erythematosus; tojcic shock syndrome; traumatic tissue 
damage; and viral, bacterial, fungal, hehninthic, and protozoal infections; an 
autoimmune/inflammatory disorder, such as acquired immunodeficiency syndrome (AIDS), 
Addison's disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, 

20 anemia, asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, 
autoimtmune polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, 
cholecystitis, contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes 
mellitus, emphysema, episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, 
erythema nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's syndrome, gout. Graves' 

25 disease, Hashimoto's thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple sclerosis, 
myasthenia gravis, myocardial or pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, 
polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, 
systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, 
ulcerative colitis, uveitis, Werner syndrome, complications of cancer, hemodialysis, and 

30 extracorporeal circulation, viral, bacterial, fungal, parasitic, protozoal, and helminthic infections, and 
trauma; and a cancer, such as, adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, 
teratocarcinoma, and, in particular, cancers of the adrenal gland, bladder, bone, bone marrow, brain, 
breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, 
pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and 
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uterus. 

In another embodiment, a vector capable of expressing VETRP or a ftagment or derivative 
thereof may be administeted to a subject to treat or prevent a disorder associated with decreased 
expression or activity of VETRP including, but not limited to, those described above. 
5 . In a farther embodiment, a composition comprising a substantially purified VETRP in 

conjunction with a suitable pharmaceutical carrier may be administered to a subject to treat or prevent 
a disorder associated with decreased expression or activity of VETRP including, but not limited to, 
those provided above. 

In still another embodiment, an agonist which modulates the activity of VETRP may be 

10 administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of VETRP inchidmg, but not limited to, those listed above. 

In a further embodiment, an antagonist of VETRP may be administered to a subject to treat or 
prevent a disorder associated with increased expression or activity of VETRP. Examples of such 
disorders mclude, but are not limited to, those vesicle trafficking disorders, 

15 autoimmune/inflammatory disorders, and cancer described above. In one aspect, an antibody which 
specifically bmds VETRP may be used directly as an antagonist or indnectly as a targeting or 
delivery mechanism for bringing a pharmaceutical agent to cells or tissues which express VETRP. 

In an additional embodiment, a vector expressing the complement of the polynucleotide 
encoding VETRP may be administered to a subject to treat or prevent a disorder associated with 

20 increased expression or activity of VETRP including, but not limited to, those described above. 

In other embodiments, any of the proteins, antagonists, antibodies, agonists, complementary 
sequences, or vectors of the invention may be administered m combination with other appropriate 
therapeutic agents. Selection of the appropriate agents for use in combination therapy may be made 
by one of ordinary skill in the art, according to conventional pharmaceutical principles. The 

25 combination of therapeutic agents may act synergistically to effect the treatment or prevention of the 
various disorders described above. Using this approach, one may be able to achieve therapeutic 
efficacy with lower dosages of each agent, thus reducing the potential for adverse side effects. 

An antagonist of VETRP may be produced using methods which are generally known in the 
art. In particular, purified VETRP may be used to produce antibodies or to screen libraries of 

30 pharmaceutical agents to identify those which specifically bind VETRP. Antibodies to VETRP may 
also be generated using methods that are weU known in the art. Such antibodies may include, but are 
not limited to, polyclonal, monoclonal, chimeric, and single chain antibodies. Fab fragments, and 
fragments produced by a Fab expression library. Neutralizing antibodies (i.e., those which inhibit 
dimer formation) are generally preferred for therapeutic use. 

35 For the production of antibodies, various hosts including goats, rabbits, rats, coice, humans, 
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and others may be iiranuBized by injection with VETRP or with any fragment or oUgopeptide thereof 
which has immunogenic properties. Depending on the host species, various adjuvants may be used to 
increase immunological response. Such adjuvants include, but are not limited to, Freund's, mineral 
gels such as aluminum hydroxide, and surface active substances such as lysolecithin, pluronic 

5 polyols, polyanions, peptides, oil emulsions. KLH, and dinitrophenol. Among adjuvants used in 
humans, BCG (bacilli Cabnette-<juerin) and Corvnebacterium parvum are especially preferable. 

It is preferred fliat the oligopeptides, peptides, or fragments used to induce antibodies to 
VETRP have an amino acid sequence consisting of at least about 5 amino acids, and generally will 
consist of at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or 

10 fragments are identical to a portion of the amino acid sequence of the natural protein. Short stretches 
of VETRP amino acids may be fused with those of another protein, such as KLH, and antibodies to 
the chimeric molecule may be produced. 

Monoclonal antibodies to VETRP may be prepared using any technique which provides for 
the production of antibody molecules by continuous cell lines in culture. These include, but are not 

15 limited to, the hybridoma technique, tiie human B-cell hybridoma technique, and the EB V-hybridoma 
technique, (See, e.g., Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. 
Immunol. Methods 81:3 M2; Cote, RJ. et al. (1983) Proc. Nati. Acad. Sci. USA 80:2026-2030; and 
Cole, SP. et al. (1984) Mol. Cell Biol. 62:109-120.) 

In addition, techniques developed for tiie production of "chimeric antibodies," such as the 

20 splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
antigen specificity and biological activity, can be used. (See, e.g., Morrison, SX.etal. (1984) Proc 
Nad. Acad. Sci. USA 81:6851-6855; Neuberger, M.S, et al. (1984) Nature 312:604-608; and Takeda, 
S. et al. (1985) Nature 314:452-454.) Alternatively, techniques described for the production of single 
chain antibodies may be adapted, using methods known ui the art, to produce VETRP-specific single 

25 chain antibodies. Antibodies witii related specificity, but of distinct idiotypic composition, may be 
generated by chaui shuffling from random combinatorial immunoglobulin libraries. (See, e.g., 
Burton, DJt. (1991) Proc. Nati. Acad. Sci. USA 88:10134-10137.) 

Antibodies may also be produced by inducing in vivo production in the lymphocyte 
population or by screening immunoglobulin libraries or panels of highly specific binding reagents as 

30 disclosed in tiie literature. (See, e.g., Qrlandi, R. et al. (1989) Proc. Nati. Acad, Sci. USA 
86:3833-3837; Winter, G. et al. (1991) Nature 349:293-299.) 

Antibody fragments which contain specific bmding sites for VETRP may also be generated. 
For exam|)le, such fragments include, but are not limited to, F(ab% fragments produced by pepsin 
digestion of tiie antibody molecule and Fab fragments generated by reducing the disulfide bridges of 

35 tiie F(ab')2 fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and 
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easy identification of monoclonal Fab fragments with the desired specificity. (See, e.g., Huse, W.D. 
et al. (1989) Science 246:1275-1281.) 

Various immunoassays may be used for screening to identify antibodies having the desired 
specificity. Numerous protocols for competitive bmding or immunoradiometric assays using either 

5 polyclonal or monoclonal antibodies with established specificities are well known in the art Such 
mununoassays typically involve the measurement of complex formation between VETRP and its 
specific antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies 
reactive to two non-interfering VETRP epitopes is generally used, but a competitive binding assay 
may also be employed (Pound, supm) . 

10 Various methods such as Scatchard analysis in conjunction vnth radioimmunoassay 

techniques may be used to assess the affinity of antibodies for VETRP. Affinity is expressed as an 
association constant, K^, which is defined as the molar concentration of VETRP-antibody complex 
divided by the molar concentrations of free antigen and free antibody under equiUbrium conditions. 
The determined for a preparation of polyclonal antibodies, which are heterogeneous in their 

15 affinities for multiple VETRP epitopes, represents the average aflSnity, or avidity, of the antibodies 
for VETRP. The determined for a preparation of monoclonal antibodies, which are monospecific 
for a particular VETRP epitope, represents a true measure of affinity, ffigh-affinity antibody 
preparations with ranging from about 10^ to 10^^ L/mole are preferred for use in unmunoassays in 
which the VETRP-antibody complex must withstand rigorous manipulations. Low-affinity antibody 

20 preparations with ranging from about 10* to 10^ Umole are preferred for use in 

immunopurification and similar procedures which ultimately require dissociation of VETRP, 
preferably in active form, from the antibody (Catty. D. (1988) Antibodies, Volume I: A Practical 
A pproach, IRL Press, Washington DC; LiddeU, JE. and A. Cryer (1991) A Practical Guide to 
Monoclonal Antibodies, John Wiley & Sons, New York NY). 

25 The titer and avidity of polyclonal antibody preparations may be further evaluated to 

determine the quality and suitability of such preparations for certain downstream applications. For 
exaiiq)le, a polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, 
preferably 5-10 mg specific antibody/ml, is generally employed in procedures requiring precipitation 
of VETRP-antibody complexes. Procedures for evaluating antibody specificity, titer, and avidity, and 

30 guidelines for antibody quality and usage in various applications, are generally available. (See, e,g., 
Catty, supra, and Coligan et al., supra .) 

In another embodiment of the invention, the polynucleotides encoding VETRP, or any 
fragment or complement thereof, may be used for therapeutic purposes. In one aspect, modifications 
of gene expression can be achieved by designing complementary sequences or antisense molecules 

35 (DNA, RNA, PNA, or modified oligonucleotides) to the coding or regulatory regions of the gene 
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encoding VETRP. Such technology is well known in the art, and antisense oligonucleotides or larger 
fragments can be designed from various locations along the coding or control regions of sequences 
encoding VETRP. (See, e.g., Agrawal, S., ed. (1996) Antisense Therapeutics. Humana Press Xnc, 
TotawaNJ.) 

5 In therapeutic use, any gene delivery system suitable for introduction of the antisense 

sequences into appropriate target cells can be used. Antisense sequences can be delivered 
intraceUularly in the form of an expression plasmid which, upon transcription, produces a sequence 
complementary to at least a portion of the cellular sequence encoding the target protein. (See, e.g.. 
Slater, J.E. et al. (1998) J. Allergy Clin. Immunol. 102(3):469-475; and Scanlon, KJ. et al. (1995) 

10 9(13):1288-1296.) Antisense sequences can also be mtroduced intraceUularly through the use of viral 
vectors, such as retrovirus and adeno-associated virus vectors. (See, e.g.. Miller, A.D. (1990) Blood 
76:271; Ausubel, supra : Uckert, W. and W. Walther (1994) Pharmacol. Ther. 63(3):323-347.) Other 
gene delivery mechanisms mclude liposome-derived systems, artificial viral envelopes, and other 
systems known in the art. (See, e.g., Rossi, JJ. (1995) Br. Med. Bull. 51(l):217-225; Boado, R.J. et 

15 al. (1998) J. Pharm. Sci. 87(11):1308-1315; and Morris, M.C. et al. (1997) Nucleic Acids Res. 
* 25(14):273a.2736.) 

In another embodiment of tfie invention, polynucleotides encoding VETRP may be used for 
somatic or germline gene ther^y . Gene therapy may be performed to (i) correct a genetic deficiency 
(e.g., in die cases of severe combined immunodeficiency (SCID)-X1 disease characterized by X- 

20 linked inheritance (Cavazzana-Calvo, M. et al. (2000) Science 288:669-672), severe combined 
immunodeficiency syndrome associated witii an inherited adenosine deaminase (ADA) deficiency 
(Blaese, RM. et al. (1995) Science 270:475-480; Bordignon, C. et al. (1995) Science 270:470-475), 
cystic fibrosis (Zabner, J. et al. (1993) Cell 75:207-216; Crystal, R.G. et al. (1995) Hum. Gene 
Therapy 6:643-666; Crystal, R.G. et al. (1995) Hum. Gene Therapy 6:667-703), thakssamias, famiUal 

25 hypercholesterolemia, and hemophilia resulting from Factor Vm or Factor K deficiencies (Crystal, 
R.G. (1995) Science 270:404410; Verma, IM. andN. Somia (1997) Nature 389:239-242)), (ii) 
express a conditionally lethal gene product (e.g., in the case of cancers which result from unregulated 
cell proliferation), or (iii) express a protein which affords protection against intracelhilar parasites 
(e.g., against human retroviruses, such as human immunodeficiency vims (HIV) (Baltimore, D. 

30 (1988) Nature 335:395-396; Poeschla, E. et al. (1996) Proc. NaU, Acad. Sci. USA. 93: 1 1395-1 1399), 
hepatitis B or C vims (HBV, HCV); fungal parasites, such as Candida albicans and Paracoccidioides 
brasiliensis : and protozoan parasites such as Plasmodium falcipamm and Trypanosoma cruzi). In the 
case where a genetic deficiency in VETRP expression or regulation causes disease, the expression of 
VETRP from an appropriate population of transduced cells may alleviate the clinical manifestations 

35 caused by the genetic deficiency. 
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In a further embodiment of the invention, diseases or disorders caused by deficiencies in 
VETRP are treated by constructing mammaUan expression vectors encoding VETRP and introducing 
these vectors by mechahical means into VETRP-deficient cells. Mechanical transfer technologies for 
use with cells in vivo or ex vitro include (i) direct DNA microinjection into individual cells, (ii) 
5 ballistic gold particle delivery, (iii) Uposome-mediated transfection, (iv) receptor-mediated gene 
transfer, and (v) the use of DNA transposons (Morgan, RA. and WP. Anderson (1993) Annu. Rev. 
Biochem. 62:191-217; Ivies. Z. (1997) Cell 91:501-510; Boulay. J-L. and H. R6cipon (1998) Curr. 
Opin. Biotechnol. 9:445-450). 

Expression vectors that may be effective for the expression of VETRP include, but are not 
10 limited to. die PCDNA 3.1, EPUAG. PR(XMV2. PREP. PVAX vectors (Invitrogen. Carlsbad GA). 
PCMV-SCRIPT, PCMV-TAG, PEGSH/PERV (Stratagene, La Jolla CA), and PTET-OFF. 
FTET-ON. PrRE2, FrRE2-LUC. PTK-HYG (Clontech. Palo Alto CA). VETRP may be expressed 
using (i) a constitutively active promoter. (e.g.. from cytomegalovirus (CMV), Rous sarcoma virus 
(RSV). SV40 virus, thymidine kinase (TK), or p-actin genes), (ii) an inducible promoter (e.g., the 
15 tetracyclme-regulated promoter (Gossen, M. and H. Bujard (1992) Proc. Nad. Acad. Sci. USA 

89:5547-5551; Gossen, M. et al. (1995) Science 268:1766-1769; Rossi, FM.V. and HM. Blau (1998) 
Curr. Opin. Biotechnol. 9:451-456), commercially available in the T-REX plasmid (Invitrogen)); the 
ecdysone-inducible promoter (available in the plasmids PVGRXR and PIND; Invitrogen); the 
FK506/rapamycin inducible promoter; or the RU486/mifepristone inducible promoter (Rossi, FM.V. 
20 and H.M. Blau, supra)), or (iii) a tissue-specific promoter or the native promoter of the endogenous 
gene encoding VETRP from a normal individual. 

Commercially available liposome transformation kits (e.g., the PERFECT LIPID 
TRANSFECnON HT, avaUable from Invitrogen) allow one with ordinary skill in the art to deliver 
polynucleotides to target ceUs in culture and requite minimal effort to optimize experimental 
25 parameters. In the alternative, transformation is performed using the calcium phosphate method . 
(Graham, FL. and A.J. Eb (1973) Virology 52:456^7), or by electroporation (Neumann, E. et al. 
(1982) EMBO J. 1:841-845). The introduction of DNA to primary cells requires modification of 
diese standardized mammalian transfection protocols. 

In another embodiment of the invention, diseases or disorders caused by genetic defects with 
30 respect to VETRP expression are treated by constracting a retrovirus vector consisting of (i) the 

polynucleotide encoding VETRP under the control of an independent promoter or the retrovirus long 
terminal repeat (LTR) promoter, (ii) appropriate RNA packaging signals, and (iu) a Rev-responsive 
element (RRE) along with additional retrovirus c«-acting RNA sequences and coding sequences 
required for efficient vector propagation. Retrovirus vectors (e.g., PFB and PFBNEO) are 
35 commerciaUy available (Stratagene) and are based on pubUshed data (Riviere, L et al. (1995) Ptoc. 
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Natl. Acad. Sci. USA 92:6733-6737), incoiporated by reference herein. The vector is propagated in 
an appropriate vector producing ceU line (VPCL) that expresses an envelope gene with a tropism for 
receptors on the target cells or a promiscuous envelope protein such as VSVg (Armentano, D. et al. 
(1987) J. Virol. 61:1647-1650; Bender. MA et al. (1987) J. Virol. 61:1639-1646; Adam, M.A. and 
A.D. Miller (1988) J. Virol. 62:3802-3806; Dull, T. et al. (1998) J. Virol. 72:8463-8471; Zufferey. R. 
et al. (1998) J. Virol. 72:9873-9880). U.S. Patent Number 5,910,434 to Rigg ("Method for obtaining 
retrovirus packaging cell lines producing high transducing efficiency retroviral supernatant") 
discloses a method for obtaining retrovirus packaging ceU lines and is hereby incoiporated by 
reference. Propagation of retrovirus vectors, transduction of a population of ceUs (e.g., CD4* T- 
10 cells), and the return of transduced ceUs to a patient are procedures well known to persons skilled in 
the art of gpne therapy and have been weU documented (Ranga, U. et al. (1997) J. Virol. 71:7020- 
7029; Bauer, G. et al. (1997) Blood 89:2259-2267; Bonyhadi. Mi. (1997) J. Virol. 71:4707-4716; 
Ranga. U. et al. (1998) Proc. Natl. Acad. Sci. USA 95:1201-1206; Su. L. (1997) Blood 89:2283- • 
2290). 

15 In the alternative, an adenovirus-based gene therapy deUvery system is used to deUver 

polynucleotides encoding VETRP to ceUs which have one or more genetic abnormaUties with respect 
to the expression of VETRP. The construction and packaging of adenovirus-based vectors are well 
known to those with ordinary skill in the art. Replication defective adenovirus vectors have proven to 
be versatile for importing genes encoding imraunoregulatory proteins into intact islets in the pancreas 

20 (Csete. MB. et al. (1995) Transplantation 27:263-268). Potentially useful adenoviral vectors are 
described in U.S. Patent Number 5.707,618 to Armentano ("Adenovirus vectors for gene therapy"), 
hereby incorporated by reference. For adenoviral vectors, see also Antinozzi, PA et al. (1999) 
Annu. Rev. Nutr. 19:511-544; and Verma. UA. and N. Somia (1997) Nature 18:389:239-242, both 
incorporated by reference herein. 

25 In another alternative, a herpes-based, gene therapy deUvery system is used to deliver 

polynucleotides encoding VETRP to target cells which have one or more genetic abnormaUties with 
respect to the expression of VETRP. The use of herpes simplex virus (HSV)-based vectors may be 
especially valuable for introducmg VETRP to cells of the central nervous system, for which HSV has 
a tropism. The construction and packaging of herpes-based vectors are well known to those with 

30 ordinary skiU in the art. A replication-competent herpes simplex virus (HSV) type 1-based vector has 
been used to deUver a reporter gene to the eyes of primates (Uu, X. et al. (1999) Exp. Eye 
Res.l69:385-395). The construction of a HSV-1 virus vector has also been disclosed in detail in U.S. 
Patent Number 5,804,413 to DeLuca ("Herpes simplex virus strains for gene transfer"), which is 
hereby mcorporated by reference. U.S. Patent Number 5,804,413 teaches the use of recombinant 
35 HSV d92 which consists of a genome containing at least one exogenous gene to be transferred to a 
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cell under the control of the appropriate promoter for purposes including human gene therapy. Also 
taught by this patent are the construction and use of recombinant HSV strains deleted for ICP4, ICP27 
and ICP22. For HSV vectors, see also Coins, WP. et al. (1999) J. Virol. 73:519-532 and Xu, H, et al. 
(1994) Dev. Biol. 163:152-161, hereby incorporated by reference. The manipulation of cloned 
5 herpesvirus sequences, the generation of recombinant virus following the transfection of multiple 
plasmids containing different segments of the large herpesvirus genomes, the growth and propagation 
of herpesvirus, and the infection of cells with herpesvirus are techniques well known to those of 
ordinary skill in the art 

In another alternative, an alphavirus (positive, single-stranded RNA virus) vector is used to 
10 deliver polynucleotides encoding VETRP to target ceUs. The biology of the prototypic alphavirus, 
Semliki Forest Vims (SFV), has been studied extensively and gene transfer vectors have been based 
on the SFV genome (Garoff, H. and K.-J. Li (1998) Curr. Opin. Biotechnol. 9:464-469). During 
alphavirus RNA replication, a subgenomic RNA is generated that normally encodes the viral capsid 
protems. This subgenomic RNA replicates to higher levels than the fuU-length genomic RNA, 
15 resulting in the overproduction of capsid proteins relative to the viral proteins with enzymatic activity 
(e.g., protease and polymerase). Similarly, inserting Ae coding sequence for VETRP into the 
alphavirus genome in place of the capsid-coding region results in the production of a large number of 
VETRP-<;oding RNAs and the synthesis of high levels of VETRP in vector transduced cells. While 
alphavirus infection is typically associated with cell lysis within a few days, the ability to establish a 
20 persistent infection in hamster normal kidney ceUs (BHK-21) with a variant of Sindbis vims (SIN) 
mdicates that the lytic replication of alphavirases can be altered to suit the needs of the gene therapy 
appUcation (Dryga, S.A. et al. (1997) Virology 228:74-83). The wide host range of alphavirases will 
allow the introduction of VETRP into a variety of ceH types. The specific transduction of a subset of 
cells in a population may require the sorting of ceUs prior to transduction. The methods of 
25 manipulating infectious cDNA clones of alphavirases, performing alphavirus cDNA and RNA 

transf ections, and performing alphaviras infections, are well known to those with ordinaiy skill in the 
art. 

Oligonucleotides derived from the transcription initiation site, e.g., between about positions 
-10 and +10 from the start site, may also be employed to inhibit gene expression. Similarly, 

30 inhibition can be achieved using triple helix base-pairing methodology. Triple helix pairing is useful 
because it causes inhibition of the ability of the double helix to open sufficiently for the binding of 
polymerases, transcription factors, or regulatory molecules. Recent therapeutic advances using 
triplex DNA have been described in the literature. (See, e.g.. Gee, JJE. et al. (1994) in Huber, B£. 
and B.L Carr, Molecular and Immunologic Approaches, Futura Publishing, Mt. Kisco NY, pp. 163- 

35 177.) Aconq)lementary sequence or antisense molecule may also be designed to block translation of 
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mRNA by preventing the transcript from binding to ribosonies. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 
RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme 
molecule to complementary target RNA, f oUowed by endonucleolytic cleavage. For example, 
5 engineered hammerfiead motif ribozyme molecules may specifically and efficiently catalyze 
endonucleolytic cleavage of sequences encoding VETRP. 

Specific ribozyme cleavage sites within any potential RNA target are initially identified by 
scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA, 
GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides, 
10 corresponding to the region of the target gene containing the cleavage site, may be evaluated for 
secondary stractural features which may render the oligonucleotide inoperable. The suitability of 
candidate targets may also be evaluated by testing accessibility to hybridization with complementary 
oligonucleotides using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes of the invention may be prepared 
15 by any method known in the art for the synthesis of nucleic acid molecules. These include techniques 
for chemically synthesizing oligonucleotides such as solid phase phosphoramidite chemical synthesis. 
Altematively, RNA molecules may be generated by in vitro and in vivo transcription of DNA 
sequences encoding VETRP. Such DNA sequences may be incorporated into a wide variety of 
vectors with suitable RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA 
20 constructs that syntiiesize complementary RNA, constitutively or inducibly, can be introduced into 
cell lines, cells, or tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. Possible 
modifications include, but are not limited to, the addition of flanking sequences at the 5' and/or 3' 
ends of the molecule, or the use of phosphorothioate or 2' O-methyl rather than phosphodiesterase 
• 25 linkages within the backbone of the molecule. This concept is inherent in the production of PNAs 
and can be extended in all of tiiese molecules by the inclusion of nontraditional bases such as inosine, 
queosme, and wybutosine, as weU as acetyl-, methyl-, thio-, and similarly modified forms of adenine, 
cytidine, guanine, thymine, and uridine which are not as easily recognized by endogenous 
endonucleases. 

30 An additional embodiment of the invention encompasses a metiiod for screening for a 

compound which is effective in altering expression of a polynucleotide encoding VETRP. 
Compounds which may be effective in altering expression of a specific polynucleotide may include, 
but are not limited to, oligonucleotides, antisense oligonucleotides, triple helix-forming 
oligonucleotides, transcription factors and other polypeptide transcriptional regulators, and non- 
35 macromolecular chemical entities which are capable of interacting with specific polynucleotide 
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sequences. Effective compounds may alter polynucleotide expression by acting as either inhibitors or 
promoters of polynucleotide expression. Thus, in the treatment of disorders associated with increased 
VETRP expression or activity, a compound which specifically inhibits expression of the 
polynucleotide encoding VETRP may be therapeutically useful, and in tiie treament of disorders 
associated with decreased VETRP expression or activity, a compound which specificaUy promotes 
expression of the polynucleotide encoding VETRP may be therapeutically useful. 

At least one. and up to a plurality, of test compounds may be screened for effectiveness in 
altering expression of a specific polynucleotide. A test compound may be obtained by any metiiod 
commonly known in die ait, including chemical modification of a compound known to be effective in 
altering polynucleotide expression; selection from an existing, commercially-avaaable or proprietary 
library of naturaUy-occurring or non-natural chemical conqwnnds; rational design of a compound 
based on chemical and/or structural properties of the target polynucleotide; and selection from a 
Ubrary of chemical compounds created combinatorially or randomly. A sample comprising a 
polynucleotide encoding VETRP is exposed to at least one test compound thus obtained. The sample 
may comprise, for example, an intact or permeabiUzed ceU, or an inviao cell-free or reconstituted 
biochemical system. Alterations in tiie expression of a polynucleotide encoding VETRP are assayed 
by any method commonly known in tiie art. TypicaUy, the expression of a specific nucleotide is 
detected by hybridization wiA a probe having a nucleotide sequence complementary to the sequence 
of the polynucleotide encoding VETRP. The amount of hybridization may be quantified, thus 
forming flie basis for a comparison of the expression of tiie polynucleotide botii with and witiiout 
exposure to one or more test compounds. Detection of a change in the expression of a polynucleotide 
exposed to a test compound indicates that the test compound is effective in altering the expression of 
tiie polynucleotide. A screen for a compound effective in altering expression of a specific 
polynucleotide can be carried out, for example, using a vSchizosaccharomyces pombe gene expression 
system (Atidns, D. et al. (1999) U.S. Patent No. 5,932,435; Amdt. GM. et al. (2000) Nucleic Acids 
Res. 28:E15) or a human cell line such as HeLa ceU (Clarke, MX. et al. (2000) Biochem. Biophys. 
Res. Commun. 268:8-13). A particular embodiment of flie present invention involves screening a 
combinatorial Ubrary of oUgonucleotides (such as deoxyribonucleotides, ribonucleotides, peptide 
nucleic adds, and modified oUgonucleotides) for antisense activity against a specific polynucleotide 
sequence (Braice, T.W. et al. (1997) U.S. Patent No. 5,686,242; Bruice, T.W. et al. (2000) U.S. 
Patent No. 6,022,691). 

Many metiiods for introducing vectors into cells or tissues are available and equaUy suitable 
for use in vivo, invitit). and ex vivo . For ex vivo tiierapy, vectors may be intiroduced into stem ceUs 
taken from tiie patient and clonally propagated for autologous transplant back into that same patient. 
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DeUveiy by transfection, by Uposome injections, or by polycationic amino polymers may be achieved 
using methods which are weU known in the art (See, e.g., Goldman, C.K. et al. (1997) Nat. 
Biotechnol. 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of 
5 such therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and 
monkeys. 

An additional embodiment of the invention relates to the administration of a composition 
which ^aally comprises an active ingredient fonnukted with a pharmaceutically acceptable 
exdpitmL Excipients may include, for exanq)le, sugars, starches, celluloses, gums, and proteins. 
10 Various formulations are commonly known and are thoroughly discussed in the latest edition of 
Remington's Pharmaceurical Sciences (Maack Publishing, Easton PA). Such compositions may 
consist of VETRP, antibodies to VETRP, and mimetics, agonists, antagonists, or inhibitors of 
VETRP. 

The coiiq>ositions utilized in this invention may be administered by any number of routes 
15 including, but not limited to, oral, intravenous, intramuscular, intra-arterial, intramedullary, 
intrathecal, intraventricular, puhnonary, transdermal, subcutaneous, intraperitoneal, intranasal; 
enteral, topical, sublingual, or rectal means. 

Compositions for puhnonary administration may be prepared in liquid or dry powder form. 
These conqxjsitions are generally aerosolized immediately prior to inhalation by the patient In Ae 
20 case of smaU molecules (e.g. traditional low molecular weight organic drugs), aerosol dehvery of 
fast-acting formulations is weU-known in the art. In the case of macromolecules (e.g. larger peptides 
and proteins), recent developments in the field of puhnonary deUvery via the alveolar region of the 
lung have enabled the practical delivery of drugs such as insulin to blood circulation (see, e.g., Patton, 
J.S. et al., U.S. Patent No. 5,997,848). Puhnonary deUvery has the advantage of administration 
25 without needle mjection, and obviates the need for potentially toxic penetration enhancers. 

Coii^>ositions suitable for use in the invention mclude conditions wherein the active 
ingredients are contained in an effective amount to achieve the intended purpose. The determination 
of an effective dose is well within the capability of those skilled in the art. 

Specialized forms of compositions may be prepared for direct intracellular delivery of 
30 macromolecules comprising VETRP or fragments thereof. For example, liposome preparations 
containing a cell-inpermeable macromolecule may promote cell fusion and intracellular deUvery of 
the macromolecule. Alternatively, VETRP or a fragment thereof may be joined to a short cationic N- 
terminal portion from the HIV Tat-1 protein. Fusion proteins thus generated have been found to 
transduce into the cells of all tissues, includmg the brain, in a mouse model system (Schwarze, S.R. et 
35 al. (1999) Science 285: 1569-1572). 
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For any compound, the therapeutically effective dose can be estimated initiaUy either in cell 
culture assays, e.g., of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, 
monkeys, or pigs. An animal model may also be used to determine die appropriate concentration 
range and route of administration. Such information can then be used to determine useful doses and 
5 routes for administration in humans. 

A ther^utically effective dose refers to that amount of active mgredient, for example 
VETRP or fragments thereof, antibodies of VETRP, and agonists, antagonists or inhibitors of 
VETRP, which ameliorates the synq)toms or condition. Therapeutic efficacy and toxicity may be 
determined by standard pharmaceutical procedures in cell cultures or with experimental animals, such 
10 as by calculating the ED50 (the dose therapeutically effective in 50% of the population) or LD50 (the 
dose lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is flie 
therapeutic index, which can be expressed as the LD^^^ ratio. Compositions which exhibit large 
therapeutic indices are preferred. The data obtained from cell culture assays and animal studies are 
used to formulate a range of dosage for human use. The dosage contained in such compositions is 
15 preferably within a range of circulating concentrations that includes die ED50 witii Uttie or no toxicity. 
The dosage varies within tiiis range depending upon the dosage form employed, the sensitivity of the 
patient, and the route of administration. 

The exact dosage will be determined by the practitioner, in light of factors related to the 
subject requiring treatment. Dosage and administration are adjusted to provide sufficient levels of the 
20 active moiety or to maintain tiie desired effect. Factors which may be taken into account include die 
severity of the disease state, die general healdi of die subject, the age, weight, and gender of the 
subject, time and frequency of adnainistration, drug combination(s), reaction sensitivities, and 
response to therapy. Long-acting compositions may be administered every 3 to 4 days, every week, 
or biweekly depending on die half-life and clearance rate of the particular formulation. 
25 Normal dosage amounts may vary from about 0.1 peg to 100,000 ^g, up to a total dose of 

about 1 gram, depending upon die route of administration. Guidance as to particular dosages and 
methods of delivery is provided m die literature and generally available to practitioners in the art 
Those skilled in die art will employ different formulations for nucleotides than for proteins or their 
inhibitors. Similarly, deUvery of polynucleotides or polypeptides will be specific to particular ceUs, 
30 conditions, locations, etc. 
DIAGNOSTICS 

In another embodiment, antibodies which specificaUy bind VETRP may be used for die 
diagnosis of disorders characterized by expression of VETRP, or in assays to monitor patients being 
treated widi VETRP or agonists, antagonists, or mhibitors of VETRP. Antibodies useful for 
35 diagnostic purposes may be prepared in die same manner as described above for therapeutics. 
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Diagnostic assays for VETRP include methods which utilizB the antibody and a label to detect 
VETRP in human body fluids or in extracts of ceUs or tissues. The antibodies may be used with or 
without modification, and may be labeled by covalent or non-covalent attachment of a reporter 
molecule. A wide variety of reporter molecules, several of which are described above, are known in 

5 the art and may be used. 

A variety of protocols for measuring VETRP, including ELBAs, RIAs, and FACS, are 
known in tije art and provide abasis for diagnosing altered or abnormal levels of VETRP expression. 
Normal or standard vahies for VETRP expression are established by combining body fluids or ceU 
extracts taten from normal mammalian subjects, for example, human subjects, with antibody to 

10 VETRP undCT conditions suitable for con^lex formation. The amount of standard complex 

formation may be quantitated by various metiiods, such as photometric means. Quantities of VETRP 
expressed in subject, control, and disease san^les from biopsied tissues are compared widi die 
standard values. Deviation between standard and subject values estabUshes die parameters for 
diagnosing disease. 

15 In another embodiment of die invention, the polynucleotides encoding VETRP may be used 

for diagnostic purposes. The polynucleotides which may be used include oligonucleotide sequences, 
complementary RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect 
and quantify gene expression in biopsied tissues in which expression of VETRP may be correlated 
widi disease. The diagnostic assay may be used to determine absence, presence, and excess 

20 expression of VETRP, and to monitor regulation of VETRP levels during tiierapeutic intervention. 

Ja one aspect, hybridization with PGR probes which are capable of detecting polynucleotide 
sequences, including genomic sequences, encoding VETRP or closely related molecules may be used 
to identify nucleic acid sequences which encode VETRP. The specificity of the probe, whether it is 
made from a highly specific region, e.g., the 5' regulatory region, or from a less specific region, e.g., a 

25 conserved motif, and die sttingency of the hybridization or amplification will determine whedier die 
probe identifies only natarally occurring sequences encoding VETRP, allelic variants, or related 
sequences. 

Probes may also be used for the detection of related sequences, and may have at least 50% 
sequence identity to any of die VETRP encoding sequences. The hybridization probes of the subject 
30 invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO:2446 or from 
genomic sequences inchiding promoters, enhancers, and introns of the VETRP gene. 

Means for producing specific hybridization probes for DNAs encoding VETRP inchide tiie 
cloning of polynucleotide sequences encoding VETRP or VETRP derivatives into vectors for the 
production of mRNA probes. Such vectors are known in the art, are commercially available, and may 
35 be used to synthesize RNA probes in vitro by means of the addition of die appropriate RNA 
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polymerases and the appropriate labeled nucleotides. Hybridization probes nay be labeled by a 
variety of reporter groups, for example, by radionucUdes such as ^ or ^^S. or by enzymatic labels, 
such as alkaline phosphatase coupled to the probe via avidin^iotin coupling systems, and the bke. 
Polynucleotide sequences encoding VETRP may be used for the diagnosis of disorders 

associated with expression of VETRP. Examples of such di«,rders mclude. but are not limited to. a 
vesicle trafficking disorder, such as cystic fibrosis, glucose-galactose malabsorption syndrome, 
hypercholesterolemia, diabetes mellitus. diabetes insipidus, hyper- and hypoglycemia. Grave's 
disease, goiter. Cnshing's disease, and Addison' s disease; gastroint^tinal disorders including 
ulcerative colitis, gastric and duodenal ulcers; other conditions associated with abnormal vesicle 
mafficking, including acquired immunodeficiency syndrome (AIDS); aUergies includmg hay fever, 
asthma, and urticaria (hives); autoimmmie hemolytic anemia; proliferative glomeralonephritis; 
farflarnmatory bowel disease; multiple sclerosis; myasthenia gravis; rhemnatoid and osteoarthritis; 
scleroderma; Chediak-Higashi and Sjogren's syndromes; systemic tapus erythematosus; toxic shock • 
syndrome; traumatic tissue damage; and viral, bacterial, fungal, hehmnthic, and protozoal infections; 
an autoimmune/inflammatory disorder, such as acquired immunodeficiency syndrome (AIDS). 
Addison's disease, adult respiratory distress syndrome, allergies, ankylosing spondyUtis. amyloidosis, 
anemia, asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, 
autoimmune polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED). bronchitis, 
cholecystitis, contact dermatitis. Crohn's disease, atopic dermatitis, dermatomyositis. diabetes 
meUitus, emphysema, episodic lymphopenia with lymphocytotoxins. erythroblastosis fetalis, 
erythema nodosum, atrophic gastritis, glomerulonephritis. Goodpasture's syndrome, gout. Gmves' 
disease. Hashimoto's thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple sclerosis, 
myasthenia gravis, myocardial or pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, 
polymyositis, psoriasis. Reiter's syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, 
systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, 
ulcerative colitis, uveitis. Werner syndrome. compUcations of cancer, hemodialysis, and 
extracorporeal circulation, viral, bacterial, fungal, parasitic, protozoal, and hehninthic infections, and 
trauma- and a cancer, such as. adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, 
teratocarcinoma. and. in particular, cancers of the adrenal gland, bladder, bone, bone marrow, brain. 
► breast, cervix, gall bladder, ganglia, gastromtestinal tract, heart, kidney, liver, tang, muscle, ovary, 
pancreas, parathyroid, penis, prostate. saUvary glands, skin, spleen, testis, thymus, thyroid, and 
uteras . The polynucleotide sequences encoding VETRP may be used in Southern or northern 
analysis, dot blot, or other membrane-based technologies; in PGR technologies; in dipstick, pin. and 
niultifonnat EUSA-like assays; and in microarrays utilizing fluids or tissues fl:om patients to detect 
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altered VETRP expression. Such qualitative or quantitative methods are well known in the art. 

In a particular aspect, the nucleotide sequences encoding VETRP may be useful in assays that 
detect the presence of associated disorders, particularly those mentioned above. The nucleotide 
sequences encoding VETRP may be labeled by standard methods and added to a fluid or tissue 
5 samplefromapatientunderconditionssuitablefortheformationofhybridizationcomplexes. Aftera 

suitable incubation period, the sample is wad»ed and the signal is quantified and compared with a 
standard vatae. If the amount of signal in the patient sample is significantly altered in comparison to 
a control sample flien the presence of altered levels of nucleotide sequences encoding VETRP in the 
sample indicates the presence of the associated disorder. Such assays may also be used to evaluate 
10 the efficacy of a particular therapeutic treatment regimen in animal studies, in clinical trials, or to 
monitor the treatment of an individual patient 

In order to provide a basis for the diagnosis of a disorder associated with expression of 
VETRP, a normal or standard profile for expression is established. This may be accomplished by 
combinmg body fluids or cell extracts taken from normal subjects, either animal or human, with a 
15 sequence, or a fragment thereof . encoding VETRP. under conditions suitable for hybridization or 
amplification. Standard hybridization may be quantified by comparing the vahies obtained ftom 
normal subjects with values from an experiment in which a known amount of a substantially purified 
polynucleotide is used. Standard values obtained in this manner may be compared with vahies 
obtained from samples from patients who are symptomatic for a disorder. Deviation from standard 
20 values is used to establish the presence of a disorder. 

Once the presence of a disorder is estabUshed and a treatment protocol is initiated, 
hybridization assays may be repeated on a regular basis to determine if the level of expression in the 
patient begms to approximate that which is observed in the normal subject The results obtained from 
successive assays may be used to show the efficacy of treatment over a period ranging from several 
25 days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 
overexpressed) in biopsied tissue from an individual may indicate a predisposition for the 
development of the disease, or may provide a means for detecting the disease prior to the appearance 
of actual clinical symptoms. A more definitive diagnosis of this type may allow health professionals 
30 to employ preventative measures or aggressive treatment earlier thereby preventing the development 

or farther progression of the cancer. 

Additional diagnostic uses for oUgonucleotides designed from the sequences encoding 
VETRP may involve die use of PGR. These oUgomers may be chemically synthesized, generated 
enzymatically, or produced mviSo. Oligpmers will preferably contain a fragment of a polynucleotide 
35 encoding VETRP, or a fragment of a polynucleotide complementary to the polynucleotide encoding 
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VETRP, and wiU be employed mder optimized conditions for identification of a specific gene or 
condition. OUgomers may also be employed under less stringent conditions for detection or 
quantification of closely related DNA or RNA sequences. 

In a particular aspect, oUgonucleotide primers derived from tiie polynucleotide sequences 
encoding VETRP may be used to detect single nucleotide polymorphisms (SNPs). SNPs are 
substitutions, insertions and deletions that are a frequent cause of inherited or acquired genetic 
disease in humans. Methods of SNP detection include, but are not limited to, single-stranded 
conformation polymorphism (SSCP) and fluorescent SSCP (fSSC3') meUiods. In SSCP, 
oUgonucleotide primers derived fiom die polynucleotide sequences encoding VETRP are used to 
» amplifyDNAusingd»epolymerasechainreaction(PCR). Hie DNA may be derived, for example, 
from diseased or normal tissue, biopsy samples, bodily fluids, and flie like. SNPs in tiie DNA cause 
differences in the secondary and tertiary structures of PGR products in single-stianded form, and 
these differences are detectable using gel electrophoresis in non-denaturing gels. In fSCCP, tiie 
oUgonucleotide primers are fluorescently labeled, which aUovi^s detection of the ampUmers in high- 
15 throughput equipment such as DNA sequencing machines. AdditionaUy, sequence database analysis 
methods, termed in siUco SNP (isSNP). are capable of identifying polymorphisms by comparing the 
. sequence of individual overlapping DNA fragments which assemble into a common consensus 
sequence. These computer-based methods filter out sequence variations due to laboratory preparation 
of DNA and sequencing errors using statistical models and automated analyses of DNA sequence 
20 chromatograms. Jn the alternative, SNPs may be detected and characterized by mass spectromedy 
using, for example, the high throughput MASSARRAY system (Sequenom, Inc., San Diego CA). 

Methods which may also be used to quantify tiie expression of VETRP include radiolabeUng 
or biotinylating nucleotides. coampUfication of a control nucleic acid, and interpolating results from 
standard curves. (See. e.g.. Melby. P.C. et al. (1993) J. Immunol. Methods 159:235-244; Duplaa. C. 
25 et al. (1993) Anal. Biochem. 212:229-236.) The speed of quantitation of multiple samples may be 
accelerated by nmning the assay in a high-throughput format where the oUgomer or polynucleotide of 
interest is presented in various dilutions and a spectrophotometric or colorimetric response gives 
rapid quantitation. 

In fiirtiier embodiments, oUgonucleotides or longer fragments derived from any of the 
30 polynucleotidesequencesdescribedhereinmaybeusedaselementsonamicroarray. The microarray 
can be used in transcript imaging techniques which monitor tiie relative expression levels of large 
numbers of gpnes simultaneously as described in SeUhamer, J J. et al., "Comparative C3ene Transcript 
Analysis," U.S. Patent No. 5,840.484, incorporated herein by reference. The microarray may also be 
used to identify genetic variants, mutations, and polymorphisms. This information may be used to 
35 determine gene fionction. to understand the genetic basis of a disorder, to diagnose a disorder, to 
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monitor progiession/regression of disease as a function of gene expression, and to develop and 
monitor the activities of therapeutic agents in the treatment of disease. In particular, this information 
may bensed to developapharmacogpnomic profile ofapatient in order to select the most appropriate 

and effective treatment regimen for that patient. For example, tiierapeutic agents which are highly 
5 effective and display the fewest side effects may be selected for a patient based on his/her 
phannacogenomic profile. 

In another embodiment, antibodies specific for VETRP. or VETOP or fragments thereof may 
be used as elements on a microarray. The microairay may be used to monitor or measure protein- 
protein interactions, drug-target interactions, and gene expression profiles, as described above. 
,0 A particular embodiment relates to the use of die polynucleotides of the present invention to 

generate a transcript image of a tissue or cell type. A transcript image represents the global pattern of 
gene expression by a particular tissue or cell type. Global gene expression patterns are analyzed by 
quantising the nnmber of expressed genes and their relative abmidance mider given conditions and at 
a given time. (See Seilhamer et al.. •'Comparative CJene Transcript Analysis." U.S. Patent Number 
15 5,840.484, expressly incorporated by reference herein.) Thus a transcript image may be generated by 
hybridizing the polynucleotides of the present invention or their con^lements to the totaUty of 
transcriptsorreversetranscriptsofaparticulartissueorcelltype. In one embodiment, the 

hybridization takes place in high-throughput format, wherein the polynucleotides of the present 
invention or their complements compriseasubsetofapluraUty of elements onamicroarray. The 

20 resultant transcript image would provide a profile of gene activity. 

Transcript images may be generated using transcripts isolated fromtissues. ceU lines, 
biopsies, or other biological samples. The transcript image may thus reflect gene expression jn^dyo. 
as in the case of a tissue or biopsy sample, or in^3tio. as in the case of a cell line. 

Transcript images which profile the expression of the polynucleotides of the present 
25 invention may also be used in conjunction wifli invifis niodel systems and preclinical evaluation of 
pharmaceuticals, as well as toxicological testing of industrial and namrally^urring environmental 
compounds. All compounds induce characteristic gene expression patterns, fiequenfly termed 
molecular fingerprints or toxicant signamres. which are indicative of mechanisms of action and 
toxicity (Nuwaysir, EK et al. (1999) Mol. Careinog. 24:153-159; Steiner, S. and N.L. Anderson 
30 (2000) Toxicol. Lett. 112-113:467-471, expressly incorporated by reference herein). If atest 
compound hasasignature similar to that ofacompound with known toxicity,it is likely to share 

those toxic properties. These fingerprints or signatures are most useful and refined when they contain 
expression information from a large number of genes and gene famiUes. Ideally, a genome-wide 
measurement of expression provides the hi^est quaUty signature. Even genes whose expression is 
35 not altered by any tested compounds are important as well, as tiie levels of expression of these genes 
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are used to nonnaUze the rest of the expression data. The normalization procedure is useful for 
conq,arison of expression data after treatment with different compounds. WhDe the assignment of 
gene function to elements of a toxicant signature aids in interpretation of toxicity mechanisms, 
knowledge of gene function is not necessary for the statistical matching of signatures which leads to 
5 prediction of toxicity. (See, for example. Press Release 00-02 from the National bstitote of 
Environmental Health Sciences, released February 29, 2000, available at 
http://wwwauehs.nih.gov/ocMews/toxchip.htm.) Therefore, it is important and desirable in 
toxicological screening using toxicant signatures to inchide all expressed gene sequences. 

In one embodiment, the toxicity of atest compound is assessed by treatingabiological 

10 samplecontainingnucleicacidswiththetestcompound. Nucleic acids that are expressed in the 
treated biological sample are hybridized with one or more probes specific to the polynucleotides of 
the present mvention, so that transcript levels corresponding to the polynucleotides of the piesent 
invention may be quantified. Tlie transcript levels in the treated biological sample are compared with 
levels in an untreated biological sample. Differences in the transcript levels between the two samples 
15 are indicative of a toxic response caused by the test compound in the treated sample. 

Another particular embodiment relates to die use of the polypeptide sequences of the present 
invention to analyze the proteome of a tissue or ceU type. The term proteome refers to the global 
pattemofproteinexpressioninaparticulartissueorcelltype. Each protein component of a 
proteome can be subjected individually to further analysis. Proteome expression patterns, or profiles. 
20 are analyzed by quantifymg the number of expressed proteins and their relative abundance under 
given conditions and at a given time. A profile of a cell' s proteome may thus be generated by 
separating and analyzmg th^polypeptides of a particular tissue or cell type. In one embodiment, the 
separation is achieved using two-dimensional gel electrophoresis, in which proteins from a sample are 
separated by isoelectric focusing m the first dimension, and then according to molecular weight by 
25 sodimn dodecyl sulfate slab gel electrophoresis in the second dimension (Steiner and Anderson, 
sopra). The proteins are visualized in the gel as discrete and uniquely positioned spots, typically by 
staining the gel with an agent such as Coomassie Blue or sUver or fluorescent stains. The optical 
density of each protein spot is generaUy proportional to the level of the protein in the sample. The 
optical densities of equivalently positioned protein spots ftom different samples, for example, from 
30 biological samples either treated or untreated with a test compomid or therapeutic agent, are 

compared to identify any changes in protein spot density related to the treatment The proteins in the 
spots are partially sequenced using, for example, standard methods employing chemical or enzymatic 
cleavage foUowed by mass spectrometry. The identity of the protein inaspot may be determmed by 

comparing its partial sequence, preferably of at least 5 contiguous ammo acid residues, to the 
35 polypeptidesequencesofthepresentinvention. In some cases, further sequence data may be 
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obtained for definitive protein identification. 

A proteomic profile may also be generated using antibodies specific for VETRP to quantify 
the levels of VETRP expression. In one embodiment, the antibodies are used as elements on a 
microarray, and protein expression levels are quantified by exposing the microarray to the sample and 
5 detecting the levels of protein bound to each array element (Lueking. A. et al. (1999) Anal. Biochem. 
270:103-111; Mendoze, L.G. et al. (1999) Biotechniques 27:778-788). Detection may be performed 
by a variety of methods known b the art, for example, by reacting the proteins in the sample with a 
thiol- or amino-reactive fluorescent compound and detecting the amount of fluorescence boraid at 
each array element 

10 Toxicant signatures at the proteome level are also useful for toxicological screening, and 

should be analyzed m parallel with toxicant signatures at the transcript level. There is a poor 
correlation between transcript and protein abmidances for some proteins in some tissues (Anderson, 
Ni. and J. Seilhamer (1997) Electrophoresis 18:533-537), so proteome toxicant signatures may be 
usefal in the analysis of compounds which do not significanUy affect the transcript image, but which 

15 altertheproteomicprofile. In addition, the analysis of transcripts in body fluids is difficult, due to 
rapid degradation of mRNA. so proteomic profiling may be more reliable and informative in such 
cases. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with the test compound. Proteins diat are expressed in the treated 

20 biological sample are separated so that the amount of each protein can be quantified. The amount of 
each protein is compared to the amount of the corresponding protein in an untreated biological 
sample. A difference in the amount of protein between the two samples is indicative of a toxic 
response to the test compound in the treated sample. Individual proteins are identified by sequencing 
the amino acid residues of the individual proteins and comparing these partial sequences to the 

25 polypeptides of the present invention. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing FOteins with the test compound. Proteins from the biological sample are 
incubated with antibodies specific to the polypeptides of the present invention. The amount of 
protein recognized by the antibodies is quantified. The amount of protein in the treated biological 

30 sampleiscompaiedwiththeamomitinanuntreatedbiologicalsample. A difference in the amount of 
protein between the two samples is indicative of a toxic response to the test compound m die treats 
sample. 

Microarrays may be prepared, used, and analyzed using methods known in the art. (See, e.g., 
Breraian. TM. et al. (1995) U.S. Patent No. 5.474,796; Schena. M. et al. (19%) Proc. Natl. Acad. Sci. 
35 USA 93:10614-10619; BaldeschweUer et al. (1995) PCT appUcation W095/251116; Shalon. D. et al. 
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(1995) per appUcation WO95/35505; Heller. R.A. et al. (1997) Ptoc. Natl. Acad. Sci. USA 94:2150- 
2155; and HeUer, MJ. et al. (1997) U.S. Patent No. 5,605,662.) Various types of microanays are 
well known and thoroughly described in DNA Microaiiavs: A Practical A pproach, M. Schena, ed. 
(1999) Oxford University Press, London, hereby expressly incorporated by reference. 

In another embodiment of the invention, nucleic acid sequences encoding VETRP may be 
used to generate hybridization probes useful in mapping the naturally occurring genomic sequence. 
Either coding or noncoding sequences may be used, and in some instances, noncoding sequences may 
be preferable over coding sequences. For example, conservation of a coding sequence among 
members of a multi-gene family may potentially cause undesircd cross hybridization during 
chromosomal mapping. The sequences may be mapped to a particular chromosome, to a specific 
region of a chromosome, or to artificial chromosome constructions, e.g., human artificial 
chromosomes (HACs), yeast artificial chromosomes QTACs), bacterial artificial chromosomes 
(BACs), bacterial PI constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, J J. 
et al. (1997) Nat. Genet. 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B.J. 
(1991) Trends Genet. 7:149-154.) Once mapped, the nucleic acid sequences of the invention may be 
used to develop genetic Unkage maps, for example, which correlate the inheritance of a disease state 
with the inheritance of a particular chromosome region or restriction fragment length polymorphism 
(RFLP). (See. e.g.. Lander, E.S. and D. Botstdn (1986) Proc. Nati. Acad. Sci. USA 83:7353-7357.) 

Huorescent in situ hybridization (FISH) may be correlated with other physical and gpnetic 
map data. (See, e.g., Heinz-Ulrich. et al. (1995) in Meyers. suEm, PP- %5-968.) Examples of genetic 
map data can be found in various scientific journals or at the Online Mendelian Inheritance in Man 
(OMIM) World Wide Web site. Correlation between the location of the gene encoding VETRP on a 
physical map and a specific disorder, or a predisposition to a specific disorder, may help define the 
region of DNA associated with that disorder and thus may further positional cloning efforts. 

In situ hybridization of chromosomal preparations and physical mapping techniques, such as 
linkage analysis using estabUshed chromosomal markers, may be used for extending genetic maps. 
Often the placement of a gene on the chromosome of anoflier mammalian species, such as mouse, 
may reveal associated markers even if the exact chromosomal locus is not known. This information is 
valuable to investigators searching for disease gpnes using positional cloning or other gene discovery 
techniques. Once the gene or genes responsible for a disease or syndrome have been crudely 
localized by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to llq22-23, 
any sequences mapping to that area may represent associated or regulatory genes for further 
investigation. (See, e.g., Gatti, RA. et al. (1988) Nature 336:577-580.) The nucleotide sequence of 
the instant invention may also be used to detect differences in the chromosomal location due to 
translocation, inversion, etc., among normal, cander, or affected individuals. 
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m another embodiment of the invention. VEraP. its catalytic or immunogenic fragments, or 
oUgoi^ptides thereof canbe used for screeninglibraries of compounds in any ofav^^ 
screeningtechniques. The fragment employedin such screeningmay be free^ 
soUd support, borne on a ceU surface, or located intraceUularly. m form^^^ 

5 betweenVETRPandtheagentbeingtestedmaybemeasuted. 

Another technicp^e for drug screeningprovides for high throughput screening of ^^^^ 

having suitable binding affmity to the protein of interest. (See. e.g.. CSeysen^ 
appUcationWO84/03564.) Inthismethod.largenumbersof different smaU test compounds are 
^thesi^onasolid substrate. Tl»e test compounds arereacted with VEITJP.orfra^^ 
10 andwashed. Bound VFTRP is then detected by methods weU known in the art. Purified VETRP can 
alsobecoateddirectfyontoplatesforuseintheaforementioneddrugscreeningtechni^^^^ 
Altematively.non-neutralizing antibodies can beused tocapturethepeptide^^ 

solid support , . 

m another embodiment, one may use competitive drug «:reening assays in whxch neutrabzmg 
15 ant.1>odiescapableofbindingVETT«>specificaUycompetewithatestcompoundforb 
VETRP. I„thismanner.antibodiescanbeusedtodetectthep«senceofanypeptide 

one or more antigenic determinants with VETRP. 

madditional embodiments, thenucleotide sequences whichencodeVETRPmaybeusedm 

any molecular biology techniques that haveyettobedeveloped, provided d.enew techniques rely on 
20 properties of nucleotide sequences that are currenUy known, including, but not limited to. such 

properties as the triplet genetic code and specific base pair interactions. 

WiAoutfurAer elaboration, it is beUeved that one skiUed in the art can, using thepreceto^ 

descripdon.utilizethepresentinventiontoitsfimestextentmfoHowingpreferreds^^^ 

embodiments are, therefore, tobeconstruedas merely inustrative,andnotn^^^^^^ 

25 of the disclosure in any way whatsoever. 

The disclosures of aIlpatents.appUcations,andpubUcations mentioned above andbelow.m 

particular U.S. Ser. No. 60/172.968 and U.S. Ser. No. 60/172.066 are hereby expressly incorporated 

by reference. 
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EXAMPLES 

Construction of cDNA Libraries 

RNAwaspurchasedfromClontechorisolatedfromtissuesdescribedinTable4. Some 
homogenized and lysed in guanidiniumisothiocyanate. while others were homogemzed 

rd"^^inphenolorinasuitablemixtureofdenat«rants.suchas™2DL(LifeT^^^^^^ 

35 n^nophasicsolutionofphenolandguanidineisothiocyanate. Ibe resulting lysates were centrrfuged 



tissues were 1 
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over Csa cushions or extracted with chloroform. RNA was precipitated from the lysates with either 
isopropanol or sodium acetate and ethanol. or by other routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
purity In some cases. RNA was treated with DNase. For most Ubraries. poly(A+) RNA was isolated 
nsing oligo d(DK:oupled paramagnetic particles (Promega), OUGOTEX latex particles (QIAGEN, 
Chatsworth CA), or an OLIGOIEX mRNA purification kit (QIAGEN). Alternatively. RNA was 
isolated directly from tissue lysates using other RNA isolation kits. e.g.. the POLY(A)PURE mRNA 
purification kit (Ambion, Austin TX). 

In some cases. Stratagene was provided with RNA and constructed the corresponding cDNA 
Ubraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the UNEAP 
vector system (Stratagene) or SUPERSCRffT plasmid system (Life Technologies), using the 
recommended procedures or similarmethods known in the art (See. e.g., Ausubel, 1997. sues, units 
51^6) Reverse transcription was initiatedusingoUgod(T)orrandom primers. Synthetic 
oUgonncleotide adapters were ligated to double stranded cDNA. and the cDNA was digested with the 
appn,priate «>striction enzyme or enzymes. For most libraries, the cDNA was size-selected (300- 
1000 bp) using SEPHACRYL SIOOO, SEPHAROSE CL2B. or SEPHAROSE CL4B column 
chromatography (Amersham Pharmacia Biotech) or preparative agarose gel electrophoresis. cDNAs 
, were ligated into compatible restriction enzyme sites of the polylinker of a suitable plasmid, e.g.. 
PBLUESCRIFr plasmid (Stratagene), PSPORTl plasmid (Life Technologies), pcDNA2.1 plasmid 
(fcvitrogen. Carlsbad CA), or pINCY plasmid (Incyte Genomics, Palo Alto CA). Recombinant 
plasmids were transformed into competent RcoU ceUs including XLl-Blne. XLl-BlueMRF, or 
SOLR from Stratagene or DH5o, DHIOB, or ElectroMAX DHIOB from Life Technologies. 

BL Isolation of cDNA Clones 

Plasmids obtamed as described in Example I were recovered from host cells by rnvwo 
excision using the UNEAP v^tor system (Stratagene) or by cell lysis. Plasmids were purified using 
at least one of the following: a Magic or WIZARD Minipreps DNA purification system (Promega); an 
AGTC Mhriprep purification kit (Edge Biosystems. Gaithersburg MD); and QIAWELL 8 Plasmid, 
QIAWELL 8 Plus Plasmid, QIAWELL 8 Ultra Plasmid purification systems or the R£ Ai. PREP 96 
plasmid purification kit from QIAGEN. Following precipitation, plasmids were lesuspended in 0.1 

) ml of distiUed water and stored, with or without lyophilization, at 4''C. 

Alternatively, plasmid DNA was an^lified from host cell lysates using direct link PCR in a 
high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and thermal 
cycling steps were carried out in a single reaction mixture. Samples were processed and stored m 
384-weU plates, and the concentration of amplified plasmid DNA was quantified fluorometrically 

5 nsingPICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKANH fluorescence 
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scanner (Labsystems Oy, Helsinki, Finland), 
m. Sequencing and Analysis 

Incyte cDNA recovered in plasmids as described in Example H were sequenced as follows. 
Sequencing reactions were processed using standard methods or high-throughput instrumentation 

5 such as the ABI CATALYST 800 (Applied Biosystems) diermal cycler or the PTC-200 thermal 
cycler (MS Research) in conjunction with the HYDRA microdispenser (Robbins Scientific) or the 
MICROLAB 2200 (Hamilton) Uquid transfer system. cDNA sequencing reactions were prepared 
using reagents provided by Amersham Pharmacia Biotech or suppUed in ABI sequencing kits such as 
the ABI PRISM BIGDYE Terminalor cycle sequencing ready reaction kit (Applied Biosystems). 

10 Electrophoretic separation of cDNA sequencing reactions and detection of labeled polynucleotides 
were carried out using the MEGABACE 1000 DNA sequencing system (Molecular Dynamics); fee 
ABI PRISM 373 or 377 sequencing system (Applied Biosystems) in conjunction with standard ABI 
protocols and base calling software; or odier sequence analysis systems known in the art. Reading 
frames wifliin the cDNA sequences were identified using standard methods (reviewed in Ausubel, 

15 1997, supra, unit 7.7). Some of the cDNA sequences were selected for extension using the techniques 

disclosed in Example VL 

The polynucleotide sequences derived from cDNA sequencing were assembled and analyzed , 
using a combination of software programs which utilize algorithms weU known to those skilled in the 
• art. Table 5 summarizes the tools, programs, and algorithms used and provides aj^hcable 

20 descriptions, references, and threshold parameters. The first cohmm of Table 5 shows the tools, 
programs, and algorithms used, the second column provides brief descriptions tiiereof, the third 
column presents appropriate references, all of which are incorporated by reference herein in their 
entirety, and the fourth column presents, where applicable, the scores, probabiUty values, and other 
parameters used to evaluate the strength of a match between two sequences (the higher die score, the 

25 greater die homology between two sequences). Sequences were analyzed using MACDNASIS PRO 
software (Hitachi Software Engineering, South San Francisco CA) and LASERGENE software 
(DNASTAR). Polynucleotide and polypeptide sequence alignments were generated using die default 
paiameteis specified by the clustal algorithm as incoipotated into the MEGALIGN multisequence 
alignment program (DNASTAR), which also calculates the percent identity between aligned 

30 sequences. 

The polynucleotide sequences were validated by removing vector, linker, and polyA 
sequences and by masking ambiguous bases, using algorithms and programs based on BLAST, 
dynamic programing, and dinucleotide nearest neighbor analysis. The sequences were tiien queried 
against a selection of public databases such as the GenBank primate, rodent, mammaUan, vertebrate, 
35 and eukaryote databases, and BLOCKS, PRINTS, DOMO. PRODOM, and PFAM to acquire 
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annotation using prognuns based on BLAST. FASTA. and BLIMPS. The sequences were assembled 
into fiiU length polynucleotide sequences using programs based on Phred, Phrap. and Consed, and 
were screened for open reading frames using programs based on GeneMark, BLAST, and FASTA. 
The ftdl length polynucleotide sequences were translated to derive the corresponding foU length 

5 amino acid sequences, and these full length sequences were subsequentiy analyzed by querying 
against databases such as the GenBank databases (described above), SwissProt, BLOCKS, PRINTS, 
DOMO, PRODOM. Prosite, and ffidden Markov Model (HMM)-based protein family databases such 
as PFAM. HMM is a probabilistic approach which analyzes consensus primary structures of gene 
families. (See. e.g., Eddy, S.R. (1996) Cnrr. Opin. Stract. Biol. 6:361-365.) 

10 The programs described above for the assembly and analysis of full length polynucleotide 

and amino acid sequences were also used to identify polynucleotide sequence fragments from SEQ ID 
NO:24-46. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization and 
amplification technologies were described in The Invention section above. 
IV. Analysis of Polynucleotide Expression 

15 Northern analysis is a laboratory technique used to detect the presence of a transcript of a 

gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs 
from a particular ceU type or tissue have been bound. (See. e.g., Sambrook, supra, ch. 7; Ausubel, 

1995, si^a. ch. 4 and 16.) 

Analogous computer techniques applying BLAST were used to search for identical or related 
20 molecules in cDNA databases such as GenBank or LIFESEQ (Incyte Genomics). This analysis is. 
much faster than multiple membrane-based hybridizations. In addition, die sensitivity of the 
computer search can be modified to determine whether any particular match is categorized as exact or 
similar. The basis of tiie search is the product score, which is defined as: 

BLAST Score x Percent I dentity 
25 5 X minimum {length(Seq. 1), length(Seq. 2)} 

The product score takes into account both the degree of similarity between two sequences and flie 
length of the sequence match. The product score is a normali2ed value between 0 and 100, and is 
calculated as follows: the BLAST score is multiplied by the percent nucleotide identity and the 

30 product is divided by (5 times the lengfli of the shorter of the two sequences). The BLAST score is 
calculated by assigning a score of +5 for every base that matches in a high-scoring segment pair 
(FKP), and -4 for every mismatch. Two sequences may share more than one HSP (separated by 
gaps). If there is more than one HSP. tiien the pair with the highest BLAST score is used to calculate 
tiie product score. The product score represents a balance between fiactional overlap and quaUty in a 

35 BLAST alignment. For example, a product score of 100 is produced only for 100% identity over the 
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entire length of the shorter of the two sequences being compared. A product score of 70 is produced 
either by 100% identity and 70% overlap at one end, or by 88% identity and 100% overlap at the 
other. A product score of 50 is produced either by 100% identity and 50% overlap at one end, or 79% 

identity and 100% overlap. 

The results of northern analyses are reported as a percentage distribution of Ubraries in which 
the transcript encoding VETRP occuned. Analysis involved the categorization of cDNA libraries by 
organ/tissue and disease. The organ/tissue categories included cardiovascular, demiatologic, 
developmental, endocrine, gastromtestinal, hematopoieticfimmune, musculoskeletal, nervous, 
reproductive, and urologic. The disease/condition categories included cancer, inflammation, trauma, 
cell proliferation, neurological, and pooled. For each category, the number of libraries expressing the 
sequence of interest was counted and divided by the total number of Ubraries across aU categories. 
Percentage values of tissue-specific and disease- or condition-specific expression are reported in 
Tables. 

V. Chromosomal Mapping of ABBR Encoding Polynucleotides 

nie cDNA sequences which were used to assemble SEQ ID NO:24-46 were compared with 
sequences from the Incyte UFESEQ database and pubUc domain databases using BLAST and other 
implementations of the Smith-Waterman algorithm. Sequences from these databases that matched 
SEQ ID NO:24^ were assembled into clusters of contiguous and overlapping sequences using 
assembly algorithms such as Phrap (Table 5). Radiation hybrid and genetic mapping data available 
from pubUc resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for 
Genome Research (WIGR), and Genethon were used to determine if any of the clustered sequences 
had been previously mapped. Inclusion of a mapped sequence in a cluster resulted in the assignment 
of all sequences of that chister, includuig its particular SEQ ID NO:, to that map location. 

nie genetic map locations of SEQ ID N0:31. SEQIDNO:36. SEQIDNO:38, SEQ ID 
NO:42, and SEQ ID NO:44 are described in The Invention as ranges, or intervals, of human 
chromosomes. More than one map location is reported for SEQ ID NO:31, SEQ ID NO:36, and SEQ 
ID NO:44, indicating that previously mapped sequences having similarity, but not conq)lete identity, 
to SEQ ID NO:31, SEQ ID NO:36, and SEQ ID NO:44 were assembled into their respective chisters. 
The map position of an interval, in centiMorgans. is measured relative to tiie terminus of the 
chromosome's p-arm. (The centiMorgan (cM) is a unit of measurement based on recombination 
frequencies between chromosomal maricers. On average, 1 cM is roughly equivalent to 1 megabase 
(Mb) of DNA in humans, although this can vary widely due to hot and cold spots of recombination.) 
The cM distances are based on genetic markers mapped by G6n6thon which provide boundaries for 
radiation hybrid markers whose sequences were included in each of die clusters. Human genome 
maps and other resources available to die public, such as the NCBI "GeneMap-99" World Wide Web 
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site (htf p //www.ncbi nlm nih f^ov/generoap/) . can be employed to determine if previously identified 
disease genes map within or in proximity to the intervals indicated above. 
VI. Extension of VETRP Encoding Polynucleotides 

The full length nucleic acid sequences of SEQ ID NO:24^ were produced by extension of 
5 an appropriate fragment of the full length molecule using oUgonucleotide primers designed from this 
fragment. One primer was synthesized to initiate 5' extension of the known fragment, and the other 
primer, to initiate 3' extension of the known fragment. The initial primers were designed using 
OUGO 4.06 software (National BiosciencesX or anodier appropriate program, to be about 22 to 30 
nucleotides in lengfli, to have a GC content of about 50% or more, and to anneal to the target 
10 sequence at temperatures of about 68'C to about 72°C. Any stretch of nucleotides which would 
result in hairpin structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one 
extension was necessary or desired, additional or nested sets of primers were designed. 

High fideUty anq>lification was obtained by PGR using methods weU known in the art. PGR 
15 was performed in 96-weU plates using the PTG-200 thermal cycler (MJ Research, Inc.). The reaction 
mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg**, (m^O^, 
and p-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), ELONGASE enzyme 
(Life Technologies), and Pfii DNA polymerase (Stiatagene), with the foUowing parameters for primer 
pair pa A and PO B: Step 1: 94^, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 68°C, 
20 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68''C. 5 min; Step 7: storage at 4°C. In the 
alternative, the parameters for primer pair T7 and SK+ were as foUows: Step 1: 94°C, 3 mm; Step 2: 
94<'C, 15 sec; Step 3: STC, 1 min; Step 4: 68^, 2 mm; Step 5: Steps 2, 3, and 4 repeated 20 times; 
Step 6: eS'C, 5 min; Step 7: storage at 4''C. 

The concentration of DNA in each well was determined by dispensing 100 nl PIGCX3REEN 
25 quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 
and 05 \i\ of undiluted PGR product into each weU of an opaque fluorimeter plate (Coming Gostar, 
Acton MA), aUowing the DNA to bind to the reagent. The plate was scanned in a Huoroskan H 
(Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantify the 
concentration of DNA. A 5 ^1 to lO^l aliquot of the reaction mixture was analyzed by 
30 electrophoresis on a 1 % agarose mini-gel to determine which reactions were successful in extending 
Ihe sequence. 

The extended nucleotides were desalted and concentrated, transferred to 384-well plates, 
digested vnth CvLfl cholera virus endonuclease (Molecular Biology Research, Madison W^, and 
sonicated or sheared prior to religation into pUG 18 vector (Amersham Pharmacia Biotech). For 
35 shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) 
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agarose gels, fragments were excised, and agar digested with Agar ACE (Promega). Extended clones 
were religated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector (Amersham 
Pharmacia Biotech), treated with Pfa DNA polymerase (Stratagene) to fill-in restriction site 
overhangs, and transfected into competent Rcoli cells. Transformed ceUs were selected on 
5 antibiotic-containing media, and individual colonies were picked and cultured overnight at 37°C in 
384-well plates in LB/2x carb liquid media. 

The ceUs were lysed, and DNA was amplified by PCR using Taq DNA polymerase 
(Amersham Pharmacia Biotech) and Pfii DNA polymerase (Stratagene) with the foBowing 
parameters: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: eO'C, 1 min; Step 4: 72°C. 2 min; 
10 Step 5: steps 2. 3, and 4 repeated 29 tunes; Step 6: 72°C 5 min; Step 7: storage at 4°C. DNA was 
quantified by HCOGREEN reagent (Molecular Probes) as described above. Samples with low DNA 
recoveries were reampMed using the same conditions as described above. Samples were diluted 
with 20% dimethysulfoxide (1:2. v/v), and sequenced using DYENAMIC energy transfer sequencing 
primers and the DYENAMIC DIRECT kit (Amersham Pharmacia Biotech) or the ABI PRISM 
15 BIGDYE Terminator cycle sequencing ready reaction kit (Applied Biosystems). 

In like manner, the polynucleotide sequences of SEQ ID NO:24-46 are used to obtain 5' 
regulatory sequences using the procedure above, along with oligonucleotides designed for such 
extension, and an appropriate genomic library. 
Vn. Labeling and Use of Individual Hybridization Probes 
20 Hybridization probes derived from SEQ ID NO:24-46 are employed to screen cDNAs, 

genomic DNAs, or mRNAs. Although flie labeling of oUgonucleotides, consisting of about 20 base 
pairs, is specifically described, essentially the same procedure is used with larger nucleotide 
fragments. OUgonucleotides are designed using state-of-the-art software such as OUGO 4.06 
software (National Biosciences) and labeled by combining 50 pmol of each oUgomer, 250 i^Ci of 
25 ly-^] adenosine triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase 
(DuPont NEN, Boston MA). The labeled oUgonucleotides are substantially purified using a 
SBPHADEX G-25 superfine size exchision dextran bead cohmm (Amersham Pharmacia Biotech). 
An aUquot containing 10^ counts per minute of the labeled probe is used in a typical membrane-based 
hybridization analysis of human genomic DNA digested with one of the following endonucleases: 
30 AseI.BglII,EcoRI,PstI,XbaI,orPvun(DuPontNEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 
membranes (Nyttan Plus. Schleicher & SchueU, Durham NH). Hybridization is carried out for 16 
hours at 40''C. To remove nonspecific signals, blots are sequentially washed at room temperature 
under conditions of up to, for example. 0.1 x saUne sodium citrate and 0.5% sodium dodecyl sulfate. 
35 Hybridization patterns are visuaUzed using autoradiography or an alternative imaging means and 
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compared. 

Vin. Microarrays 

The linkage or synthesis of array elements upon a microarray can be achieved utilizing 
photolithography, piezoelectric printing (ink-jet printing, See, e.g., Baldeschweiler, supra), 

5 mechanical microspotting technologies, and derivatives thereof. The substrate in each of the 

aforementioned technologies should be uniform and solid with a non-porous surface (Schena (1999), 
supra) . Suggested substrates include silicon, silica, glass slides, glass chips, and silicon wafers. 
Alternatively, a procedure analogous to a dot or slot blot may also be used to arrange and link 
elements to the surface of a substrate usmg thermal, UV, chemical, or mechanical bonding 

10 procedures. A typical array may be produced using available methods and machines well known to 
those of ordinary skill in the art and may contain any appropriate number of elements. (See, e.g., 
Schena, M. et al. (1995) Science 270:467-470; Shalon, D. et al. (1996) Genome Res. 6:639-645; 
Marshall, A. and J. Hodgson (1998) Nat. Biptechnol. 16:27-31.) 

Full length cDNAs, Expressed Sequence Tags (ESTs), or fragments or oligomers thereof may 

15 contprise the elements of the microarray. Fragments or oligomers suitable for hybridization can be 
selected using software well known in the art such as LASERGENE software (DNASTAR). The 
array elements are hybridized with polynucleotides in a biological sample. The polynucleotides in the 
biological sample are conjugated to a fluorescent label or other molecular tag for ease of detection. 
After hybridization, nonhybridized nucleotides from the biological sample are removed, and a 

20 fluorescence scanner is used to detect hybridization at each array element. Alternatively, laser 
desorbtion and mass spectrometry may be used for detection of hybridization. The degree of 
con^lementarity and the relative abundance of each polynucleotide which hybridizes to an element 
on the microarray may be assessed. Li one embodiment, microarray preparation and usage is 
described in detail below. 

25 Tissue or Cell Sample Preparation 

Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 
poly(A)* RNA is purified using die oligo-(dT) cellulose metiiod. Each poly(A)* RNA sample is 
reverse transcribed using MMLV reverse-transcriptase, 0.05 pg/pl oIigo-(dT) primer (21mer), IX 
first strand buffer, 0.03 units/|jl RNase inhibitor, 500 \M dATP, 500 iM dGTP, 500 dTTP, 40 

30 mM dCTP, 40 fiM dCTP-Cy3 (BDS) or dCrrP-Cy5 (Amersham Pharmacia Biotech). The reverse 
transcription reaction is performed in a 25 nnJ volume containing 200 ng poly (A) ^ RNA with 
GEMBRIGHT kits (Incyte). Specific control poly(A)'^ RNAs are synAesized by in vitro transcription 
from non-coding yeast genomic DNA. After incubation at 37 ^^C for 2 hr, each reaction sample (one 
with Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium hydroxide and 

35 incubated for 20 minutes at 85 °C to die stop the reaction and degrade the RNA. Samples are purified 
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using two successive CHROMA SPIN 30 gel filtration spin columns (CLONTECH Laboratories, Inc. 
(CLONTECH), Palo Alto CA) and after combining, both reaction samples are etiianol precipitated 
using 1 ml of glycogen (1 mg/ml), 60 ml sodium acetate, and 300 ml of 100% etbanol. The sample is 
then dried to completion using a SpeedVAC (Savant Instruments Inc., Holbrook NY) and 

5 resuspended in 14 pi 5X SSaO.2% SDS. 
Microarrav Preparation 

Sequences of the present invention are used to generate array elements. Each array element 
is amplified from bacterial cells containing vectors with cloned cDNA inserts. PGR amplification 
uses primers complementary to the vector sequences flanking tiie cDNA insert. Array elements are 

10 amplified in thirty cycles, of PGR fix)m an initial quantity of 1-2 ng to a final quantity greater dian 5 
^ig. Amplified array elements are then purified using SEPHACRYL^ (Amersham Pharmacia 
Biotech). 

Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 
sUdes (Coming are cleaned by ultrasound in 0.1% SDS and acetone, with extensive distilled water 
15 washes between and after toreatinents. Glass sUdes are etched in 4% hydrofluoric acid (VWR 
Scientific Products Corporation (VWR), West Chester PA), washed extensively in distilled water, 
and coated witih 0.05% aminopropyl silane (Sigma) in 95% ethanol. Coated sUdes are cured in a 
110°Coven. 

Array elements are applied to the coated glass substrate using a procedure described in US 
20 Patent No. 5,807,522, incorporated herein by reference. 1 pi of the array element DNA, at an average 
concentration of 100 ng/\A, is loaded into the open capillary printing element by a high-speed robotic 
apparatus. The apparatus then deposits about 5 nl of array element sample per slide. 

Microarrays are UV-crosslinked using a STRATALINKER UV-crosslinker (Stratagene). 
Microarrays are washed at room temperature once in 0.2% SDS and three times in distilled water. 
25 Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 
buffered saline (PBS) (Tropix, Inc., Bedford MA) for 30 minutes at 60 °C followed by washes in 
0.2% SDS and distilled water as before. 
Hybridization 

Hybridization reactions contain 9 pi of san^le mixture consisting of 0.2 jig each of Cy3 and 
30 Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffer. The san^le 
mixture is heated to 65 "C for 5 minutes and is aliquoted onto fee microarray surface and covered 
with an 1 .8 cm^ coverslip. Hie arrays are tiansferred to a waterproof chamber having a cavity just 
slightiy larger than a microscope slide. The chamber is kept at 100% humidity internally by the 
addition of 140 pi of 5X SSC in a comer of the chamber. The chamber containing the arrays is 
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incubated for about 6.5 hours at 60 ''C. The arrays are washed for 10 rain at 45 in a first wash 
buffer (IX SSC, 0.1% SDS), three times for 10 minutes each at 45 in a second wash buffer (O.IX 
SSC), and dried. 
Detection 

5 Reporter-labeled hybridization complexes are detected with a microscope equipped with an 

Innova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara CA) capable of generating spectral lines 
at 488 nm for excitation of Cy3 and at 632 mn for excitation of Cy5. The excitation laser light is 
focused on the array using a 20X microscope objective (Nikon, Inc., Melville NY). The slide 
containing the array is placed on a computer-controlled X-Y stage on the microscope and raster- 

10 scanned past the objective. The 1.8 cm x 1.8 cm array used in the present example is scanned with a 
resolution of 20 micrometers. 

In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. 
Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, 
Hamamatsu Photonics Systems, Bridgewater NJ) corresponding to the two fluorophores. Appropriate 

15 filters positioned between die array and the photomultiplier tubes are used to filter the signals. The 
emission maxima of the fluorophores used are 565 nm for Cy3 and 650 nm for Cy5. Each array is 
typically scanned twice, one scan per fluorophore using the appropriate filters at the laser source, 
although the apparatus is capable of recording the spectra from both fluorophores simultaneously. 
The sensitivity of the scans is typically calibrated using the signal intensity generated by a 

20 cDNA control species added to the sample mixture at a known concentration. A specific location on 
the array contains a complementary DNA sequence, allowing the intensity of the signal at that 
location to be correlated with a weight ratio of hybridizing species of 1 :100,000. When two samples 
from different sources (e.g., representmg test and control cells), each labeled with a different 
fluorophore, are hybridized to a single array for the purpose of identifying genes that are 

25 differentially expressed, the calibration is done by labeling samples of the calibrating cDNA with the 
two fluorophores and adding identical amounts of each to the hybridization mixture. 

The output of the photomultiplier tube is digitized using a 12-bit RTI-835H analog-to-digital 
(A/D) conversion board (Analog Devices, Inc., Norwood MA) installed in an IBM-compatible PC 
computer. The digitized data are displayed as an image where the signal intensity is mapped using a 

30 linear 20-color transformation to a pseudocolor scale ranging from blue Oow signal) to red (high 
signal). The data is also analyzed quantitatively. Where two different fluorophores are excited and 
measured simultaneously, the data are first corrected for optical crosstalk (due to overtyping 
emission spectra) between the fluorophores using each fluorophore*s emission spectrum. 

A grid is superimposed over the fluorescence signal image such that the signal from each 
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spot is centered in each element of the grid. The fluorescence signal within each element is then 
integrated to obtain a numerical value conesponding to die average intensity of die signal. The 
software used for signal analysis is die GEMTOOLS gene expression analysis program ancyte). 

IX. Complementary Polynucleotides 

5 Sequences complementary to die VETRP-encoding sequences, or any parts Uiereof, are used 

to detect, decrease, or inhibit expression of naturally occurring VETRP. Aldiough use of 
oligonucleotides comprising from about 15 to 30 base pairs is described, essentially die same 
procedure is used witii smaller or widi larger sequence fragments. Appropriate oligonucleotides are 
designed using OUGO 4.06 software (National Biosciences) and d»e coding sequence of VETRP. To 

10 inhibit transcription, a complementary oligonucleotide is designed from die most unique 5' sequence 
and used to prevent promoter binding to die coding sequence. To mhibit translation, a 
complementary oligonucleotide is designed to prevent ribosomal binding to die VETRP-encoding 
transcript. 

X. Expression of VETRP 

15 Expression and purification of VETRP is achieved using bacterial or virus-based expression 

systems. For expression of VETRP in bacteria, cDNA is subcloned into an appropriate vector 
containing an antibiotic resistance gene and an inducible promoter tiiat directs high levels of cDNA 
transcription. Examples of such promoters include, but are not limited to, die trp-lac {tac) hybrid 
promoter and die T5 or T7 bacteriophage promoter in conjunction witii die lac operator regidatory 

20 element. Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21(DE3). 
Antibiotic resistant bacteria express VETRP upon induction widi isoiaopyl beta-D- 
diiogalactopyranoside (IPTG). Expression of VETRP in eukaryotic cells is achieved by infecting 
insect or mammalian ceU lines witii recombinant A.itnf>raphica califomica nuclear polyhedrosis virus 
(AcMNPV), commonly known as baculovirus. The nonessential polyhedrin gene of baculovirus is 

25 replaced widi cDNA encoding VETRP by eidier homologous recombination or bacterial-mediated 
transposition involving transfer plasmid intermediates. Viral infectivity is maintained and die strong 
polyhedrin promoter drives hi^ levels of cDNA transcription. Recombinant baculovirus is used to 
infect SpnHo pteraftneiperda (Sf9) insect cells in most cases, or human hepatocytes. in some cases, 
hifection of die latter requires additional genetic modifications to baculoviras. (See Engelhard, E.K. 

30 et al. (1994) Ptoc. Nad. Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 
7:1937-1945.) 

hi most expression systems, VETRP is syndiesized as a fusion protein widi, e.g., ^utadiione 
S-transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 
afiinity-based purification of recombinant ftision protein from crude cell lysates. GST, a 26- 
35 kilodalton en2yme from .Schistosoma iaoonicum. enables die purification of fusion proteins on 
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immobilized glutatbione under conditions that maintain protein activity and antigenicity (Amersham 
Pharmacia Biotech). FoUowing purification, the GST moiety can be proteolyticaUy cleaved from 
VETRP at specificaUy engineered sites. FLAG, an 8-amino acid peptide, enables immunoaffinity 
purification using commerciaUy available monoclonal and polyclonal anti-FLAG antibodies (Eastman 
5 Kodak). 6-ffis, a stretch of six consecutive histidine residues, enables purification on metal-chelate 
resins (QIAGEN). Methods for protein expression and purification are discussed in Ausubel (1995, 
supra, ch. 10 and 16). Purified VETRP obtained by these methods can be used directly in the assays 
shown in Exan^jles XI and XV. 
XI. Demonstration of VETRP Activity 
10 VETRP activity is measured by its inclusion in coated vesicles. VETRP can be expressed by 

transforming a mammalian cell line such as COS7, HeLa. or CHO with an eukaryotic expression 
vector encoding VETRP. Eukaryotic expression vectors are commercially available, and the 
techniques to introduce them into cells are well known to those skilled in the art A smaU amount of 
a second plasmid, which expresses any one of a number of marker genes, such as p-galactosidase. is 
15 co-transformed into the ceUs in order to allow rapid identification of those ceUs which have taken up 
and expressed the foreign DNA. The cells are incubated for 48-72 hours after transformation under 
conditions appropriate for the cell line to allow expression and accumulation of VETRP and p- 
galactosidase. 

Transformed ceDs are collected and ceU lysates are assayed for vesicle formation. A non- 
20 hydiDlyzable form of OTP, GTPyS, and an ATP regenerating system are added to the lysate and the 
mixture is incubated at 37 °C for 10 minutes. Under these conditions, over 90% of the vesicles 
remain coated (Orci, L. et al (1989) CeU 56:357-368). Transport vesicles are salt-released from the 
Golgi membranes, loaded under a sucrose gradient, centrifuged, and fractions are coUected and 
analyzed by SDS-PAGE. Co-localization of VETRP with clathrin or COP coatamer is indicative of 
25 VETRP activity in vesicle formation. The contribution of VETRP in vesicle formation can be 
confirmed by incubating lysates with antibodies specific for VEIRP prior to GTPyS addition. The 
antibody will bind to VETRP and interfere with its activity, thus preventing vesicle formation. 

In the alternative, VETRP activity is measured by its ability to alter vesicle trafficking 
pathways. Vesicle trafficking in cells transformed with VETRP is examined using fluorescence 
30 microscopy. Antibodies specific for vesicle coat proteins or typical vesicle trafficking substrates such 
as transferrin or the mannose-6-phosphate receptor are commerciaUy avaUable. Various ceUular 
components such as ER, Golgi bodies, peroxisomes, endosomes, lysosomes, and the plasmalemma are 
examined. Alterations in the numbers and locations of vesicles in ceUs transformed with VETRP as 
conq)aR5d to control cells are characteristic of VETRP activity. 

35 
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Xn. Functional Assays 

VETRP ftinction is assessed by expressing the sequences encoding VETRP at physiologicaUy 
elevated levels in mammalian cell culture systems. cDNA is subcloned into a mammalian expression 
vector containing a strong promoter that drives high levels of cDNA expression. Vectors of choice 
5 include pCMV SPORT plasnrid (Life Technologies) and pCR3.1 plasmid (Invitrogen), both of which 
contain the cytomegalovirus promoter. 5-10 /zg of recombinant vector are transiently transfected into 
a human cell line, for example, an endothelial or hematopoietic cell line, using either liposome 
fotmulatirais or electroporation. 1-2 jug of an additional plasmid containing sequences encoding a 
mariffir protein are co-transfected. Expression of a marker protein provides a means to distinguish 
10 transfected cells from nontransfected cells and is a reliable predictor of cDNA expression from the 
recombinant vector. Marker proteins of choice include, e.g.. Green Fhiorescent Protein (GFP; 
Qontech), CD64, or a CD64-GFP fusion protein. How cytometry (FCM), an automated, laser optics- 
based technique, is used to identify transfected cells expressing GFP or CD64-GFP and to evaluate 
the apoptotic state of the cells and other cellular properties. FCM detects and quantifies the uptake of 
15 fluorescent molecules that diagnose events preceding or coincident with cell death. These events 
include changes in nuclear DNA content as measured by staining of DNA with propidium iodide; 
changes in ceU size and granularity as measured by forward Ught scatter and 90 degree side Ught 
scatter, down-regulation of DNA synthesis as measured by decrease in bromodeoxyuridine uptake; 
alterations in expression of cell surface and intracellular proteins as measured by reactivity with 
20 specific antibodies; and alterations in plasma membrane composition as measured by the binding of 
fluorescein-conjugated Annexin V protein to the ceU surface. Methods in flow cytometry are 
discussed in Ormerod, M.G. (1994) Flow Cvtometrv. Oxford, New York NY. 

The influence of VETRP on gene expression can be assessed using highly purified 
populations of cells transfected with sequences encoding VETRP and either CD64 or CD64-GFP. 
25 CD64 and CT>64-GFP are expressed on the surface of transfected cells and bind to conserved re^ons 
of human immunoglobulin G GgG). Transfected cells are efBcienUy separated from nontransfected 
ceDs using magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Lake 
Success NY). mRNA can be purified from the cells using methods well known by those of skill in 
the art. Expression of mRNA encoding VETRP and other genes of interest can be analyzed by 
30 northern analysis or microarray techniques. 

Xffl. Production of VETRP Specific Antibodies 

VETRP substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 
Harrington, M.G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is used to 
immunize rabbits and to produce antibodies using standard protocols. 
35 Alternatively, the VETRP amino acid sequence is analyzed using LASERGENE software 
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(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
synthesized and nsed to raise antibodies by means known to those of skill in the art Methods for 
selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are well 
described in the art. (See, e.g., Ausubel, 1995, supra, ch. 11.) 

5 Typically, oligopeptides of about 15 residues in lengdi are synthesized using an ABI 43 1 A 

peptide synthesizer (Applied Biosystems) using FMOC chemistry and coupled to KLH (Sigma- 
Aldrich, St. Louis MO) by reaction with N-maleiraidobenzoyl-N-hydroxysuccinimide ester (MBS) to 
increase immunogenicity. (See, e.g., Ausubel, 1995, supra ,) Rabbits are immunized with the 
oUgopeptide-KLH complex in complete Freund's adjuvant Resulting antisera are tested for 

10 antipeptide and anti-VETRP activity by, for example, bindmg the peptide or VETRP to a substrate, 
blocking with 1% BSA, reacting with rabbit antisera, washing, and reacting with radio-iodinated goat 
anti-rabbit IgG. 

XIV. Purification of Naturally Occurring VETRP Using Specific Antibodies 
Naturally occurring or recombinant VETRP is substantially purified by immunoaffinity 

15 chromatography using antibodies specific for VETRP. An immunoaffinity column is constructed by 
covalently coupling anti-VETRP antibody to an activated chromatographic resin, such as 
CNBr-activated SEPHAROSB (Amersham Pharmacia Biotech). After the coupling, the resin is 
blocked and washed according to the manufacturer's instructions. 

Media containing VETRP are passed over the immimoaffinity column, and the column is 

20 washed under conditions that allow the preferential absorbance of VETRP (e.g., higji ionic strength 
buffers in the presence of detergent). The column is eluted under conditions that disrupt 
antibodyAHBTRP binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, such 
as urea or thiocyanate ion), and VETRP is collected. 

XV. Identification of Molecules Which Interact with VETRP 

25 VETRP, or biologically active fragments thereof, are labeled with '^I Bolton-Hunter reagent 

(See, e.g., Bolton A.E. and W.M. Hunter (1973) Biochem, J. 133:529-539.) Candidate molecules 
previously arrayed m the wells of a multi-well plate are incubated with the labeled VETRP, washed, 
and any wells with labeled VETRP complex are assayed. Data obtained using different 
concentrations of VETRP arc used to calculate values for the number, affinity, and association of 

30 VETTRP with the candidate molecules. 

Alternatively, molecules interacting with VETRP are analyzed using the yeast two-hybrid 
system as described m Fields, S. and O. Song (1989, Nature 340:245-246), or using commercially 
available kits based on die two-hybrid system, such as tiie MATCHMAKER system (Clontech). 

VETRP may also be used in tiie PATHCALLING process (CuraGen Corp., New Haven CT) 

35 which employs die yeast two-hybrid system in a high-througjiput manner to determine all interactions 
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between the proteins encoded by two large libraries of genes (Nandabalan, K. et al. (2000) U.S. 
Patent No. 6,057,101). 

Various modifications and variations of the described naethods and systems of the invention 
5 will be apparent to those skilled in the art without departing from the scope and spirit of the 

invention. Although the invention has been described in connection with certain embodiments, it 
should be understood that the invention as claimed should not be unduly limited to such specific 
embodiments. Indeed, various modifications of the described modes for carrying out the invention 
which are obvious to those skilled in molecular biology or related fields are mtended to be within the 
10 scope of the following claims. 
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(Fraction of Total) 
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What is claimed is: 

1. An isolated polypeptide comprising an amino acid sequence selected from the group 
consisting of: 

5 a) an amino acid sequence selected from the group consisting of SEQ ID NO:l-23, 

b) anaturaUy occurring anrinoadd sequence having atleast90% sequence identity to an 

amino acid sequence selected from the group consisting of SEQ ID NO:l-23, 

c) a biologically active fiagment of an amino acid sequence selected from the group 

consisting of SEQ ID NO:l-23, and 
10 d) animmunogenicfragmentofanaminoaddsequenceselectedfromthegroupconsisting 

of SEQ ID NO: 1-23. 

2. An isolated polypeptide of claim 1 selected from the group consisting of SEQ ID NO:l- 

23. 

15 

3. An isolated polynucleotide encoding a polypeptide of claim 1. 

4. An isolated polynucleotide encoding a polypeptide of claim 2. 

20 5. An isolated polynucleotide of claim 4 selected from the group consisting of SEQ ID 

NO:24-46. 

6. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide of claim 3. 

25 

7. A ceU transformed with a recombinant polynucleotide of claim 6. 

8. A transgenic organism comprising a recombinant polynucleotide of claim 6. 

30 9. A method for producing a polypeptide of claim 1, the method comprising: 

a) ciilturing a cell under conditions suitable for expression of the polypeptide, wherein said 
cell is transformed with a recombinant polynucleotide, and said recombinant polymicleotide 
comprises a promoter sequence operably linked to a polynucleotide encoding the polypeptide of 

claim 1, and 
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b) recovering the polyjpepdde so expressed. 

10. An isolated antft)ody which specfficaUybm^ 

11. An isolated polynucleotide comprising a polynucleotide sequence selected from the 
group consisting of: 

a) a polynucleotide sequence selected from the group consisting of SEQ ID NO:24^. 

b) a naturally occurring polynucleotide sequence having at least 70% sequence identity to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:24^. 

c) a polynucleotide sequence complementary to a), 

d) a polynucleotide sequence complementary to b), and 

e) an RNA equivalent of a)-d). 

12. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a 
polynucleotide of claim 11. 

13 . A method for detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 11, the method comprising: 

a) hybridizing the sample wifli a probe comprising at least 20 contiguous nucleotides 
conq,rising a sequence complementary to said target polynucleotide in the sample, and which probe 
specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization 
complex is formed between said probe and said target polynucleotide or fragments thereof, and 

b) detecting the presence or absence of said hybridization complex, and, optionally, if 
present, the amount thereof. 

14. A method of claim 13. wherein the probe comprises at least 60 contiguous nucleotides. 

15. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 1 1, the method comprising: 

a) amplifymg said target polynucleotide or fragment thereof using polymerase chain reaction 

amplification, and 

b) detecting die presence or absence of said amplified target polynucleotide or fragment 
thaeof, and. optionaUy. if present, the amount hereof. 
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16. Acompositioncomprisinganeffectiveamountofapolypeptide of claim 1 anda 
pharmaceutically acceptable excipient. 

17. A composition of claim 16. wherein the polypeptide comprises an amino acid sequence 
5 selected from die group consisting of SEQ ID NO: 1-23. 

18 A method for treating a disease or condition associated with decreased expression of 
functional WIKP.comprisingadministeringtoapatient in need of suc^ 

of claim 16, 

19. Amethodforscreeningacompoundforeffectivenessasanagonistof apolypeptideof 

claim 1. the method comprising: 

a) exposing a sample comprising apolypeptide of claim 1 to acompound. and 

b) detecting agonist activity in the sample. 

20. AcompositioncomprisinganagonistcompoundidentifiedbyamethodofclaimWand 
a pharmaceutically acceptable excipient. 

21 Amethodfortreatingadiseaseorconditionassociatedwididecreasedexpressionof 
20 functionalVEI^.comprismgadministeringtoapatientinneedofsuchtreatmentaco^^^^^^ 

claim 20. 

22. A method for screening a compound for effectiveness as an antagonist of a polypeptide 

of claim 1, die mefliod comprising: 
25 a) exposingasamplecomprisingapolypeptideofclaimltoacompound,and 

b) detecting antagonist activity in the sample. 

23. Acompositioncomprisinganantagonistcompoundidentifiedbyamethodofclaim22 
and a pharmaceutically acceptable excipient. 



30 



24. A method for treating a disease or condition associated with overexpression of functional 
VETRP, comprising administering to a patient in need of such treatment a composition of claim 23. 

25 . A method of screening for a compound that specifically binds to the polypeptide of claim 
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1, said method comprising the steps of: 

a) combining the polypeptide of claim 1 with at least one test compound under suitable 

conditions, and 

b) detecting binding of the polypeptide of claim 1 to the test compound, thereby identifying 
5 a compound that specifically binds to the polypeptide of claim 1. 

26. A method of screening for a compound that modulates the activity of die polypeptide of 

claim 1, said method comprismg: 

a) combining the polypeptide of claim 1 with at least one test compound under conditions 

10 permissive for the activity of die polypeptide of claim 1, 

b) assessing the activity of the polypeptide of claim 1 in the presence of the test compound, 

and 

c) comparing the activity of the polypeptide of claim 1 in die presence of die test compound 
with die activity of die polypeptide of claim 1 in the absence of die test compound, wherein achange 

15 in die activity of the polypeptide of claim 1 in the presence of die test compound is indicative of a 
compound diat modulates die activity of die polypeptide of claun 1. 

27. A mediod for screening a compound for effectiveness in altering expression of a target 
polynucleotide, wherein said targpt polynucleotide comprises a sequence of claim 5, die method 

20 con^rising: 

a) exposing a sample comprising die target polynucleotide to a compound, under conditions 
suitable for die expression of die target polynucleotide, 

b) detecting altered expression of die target polynucleotide, and 

c) comparing die expression of die target polynucleotide in die presence of varying amounts 
25 of die compound and in the absence of the compound. 

28. A metiiod for assessing toxicity of a test compound, said mediod comprising: 

a) treating a biological sample containing nucleic acids widi die test compound; 

b) hybridizing die nucleic acids of die treated biological sample widi a proha comprising at 
30 least 20 contiguous nucleotides of a polynucleotide of claim 1 1 under conditions whereby a specific 

hybridization complex is formed between said probe and a target polynucleotide in die biological 
sample, said target polynucleotide comprising a polynucleotide sequence of a polynucleotide of claim 
11 or fragment diereof; 

c) quantiifymg die amount of hybridization complex; and 
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d) comparing the amount of hybridization complex in the treated biological sample with the 
amount of hybridization complex in an untreated biological sample, wherein a difference in the 
amount of hybridization complex in the treated biological sample is indicative of toxicity of the test 

compound. 
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<110> XNCYTE GENOMICS, INC. 
TANG, Y. Tom 
YUE, Henry 
BANDMAN, Olga 
HIIjLMAN, Jennifer 
BAUGHN, Mariah 
LU, Dyung Aina M. 
AZIMZAI, Yalda 

<120> VESICLE TRAFFICKING PROTEINS 
<130> PF-0756 PCT 

<140> To Be Assigned 
<141> Herewith 

<150> 60/172,968; 60/172,066 
<151> 1999-12-21; -1999-12-23 

<160> 46 

<170> PERL Program 



<210> 1 
<211> 481 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc„feature 

<223> Incyte ID No: 381039CD1 



Me?°Vai Met Ala Ala Lys Lys Gly Pro Gly Pro Gly Gly Gly Val 
- 5 10 J-^ 

Ser Gly Gly Lys Ala Glu Ala Glu Ala Ala Ser Glu Val Trp cys 
20 25 -^u 

Arg Arg Val Arg Glu Leu Gly Gly Cys Ser Gin Ala Gly Asn Arg 
35 40 '^^ 

His cys Phe Glu Cys Ala Gin Arg Gly Val Thr Tyr Val Asp lie 

Thr Val Gly Ser Phe Val Cys Thr Thr cys Ser Gly Leu Leu Arg 

Gly Leu Asn Pro P^o His Arg Val Lys Ser He Ser Met Thr Thr 

80 85 
Phe Thr Glu Pro Glu Val Val Phe Leu Gin Ser Arg Gly Asn Glu 
95 100 •'■"3 

val cys Arg Lys He Trp Leu Gly Leu Phe Asp Ala Arg Thr Ser 

110 115 J-^" 

Leu val Pro Asp Ser Arg Asp Pro Gin Lys Val Lys Glu Phe Leu 

125 1*^" 
Gin Glu Lys Tyr Glu Lys Lys Arg Trp Tyr Val Pro Pro Asp Gin 

140 145 
Val Lys Gly Pro Thr Tyr Thr Lys Gly Ser Ala ser Thr Pro Val 

Gin Gly ser He Pro Glu Gly Lys Pro Leu Arg Thr Leu Leu Gly 

170 175 1^" 

Asp Pro Ala Pro Ser Leu Ser Val Ala Ala Ser Thr Ser Ser Gin 

185 190 ±1^ 

Pro Val ser Gin Ser His Ala Arg Thr Ser Gin Ala Arg Ser Thr 

200 205 
Gin Pro Pro Pro His Ser Ser Val Lys Lys Ala Ser Thr Asp Leu 

215 220 225 

Leu Ala Asp He Gly Gly Asp Pro Phe Ala Ala Pro Gin Met Ala 

230 235 -^^^ 

Pro Ala Phe Ala Ala Phe Pro Ala Phe Gly Gly Gin Thr Pro Ser 

245 250 
Gin Gly Gly Phe Ala Asn Phe Asp Ala Phe Ser Ser Gly Pro Ser 
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260 265 270 

Ser Ser Val Phe Gly Ser Leu Pro Pro Ala Gly Gin Ala Ser Phe 

275 "^^^ il 

Gin Ala Gin Pro Thr Pro Ala Gly Ser Ser Gin Gly Thr Pro Phe 

290 29b 
Gly Ala Thr Pro Leu Ala Pro Ala Ser Gin Pro Asn Ser Leu Ala 

305 ■^■^^ T 

ASP val Gly ser Phe Leu Gly Pro Gly Val Pro Ala Ala Gly Val 

pro ser Ser Leu %l Gly Met Ala Gly Gin Val Pro Pro Leu Gin 



ser val Thr Met III Gly Gly Gly Gly s|i Ser Thr Gly Leu Ala 

350 

Phe Gly Ala Phe Thr Asn Pro Phe Thr Ala Pro Ala Ala Gin Ser 

365 -^'^ 
pro Leu Pro Ser Thr Asn Pro Phe Gin Pro Asn Gly Leu Ala Pro 

380 J o3 

Gly pro Gly Phe Gly Met Ser Ser Ala Gly Pro Gly Phe Pro Gin 

Ala val pro Pro ^ Gly Ala Phe Ala Ser Ser Phe Pro Ala Pro 

410 ^-^-^ 
Leu Phe pro Pro Gin Thr Pro Leu Val Gin Gin Gin Asn Gly Ser 

425 

ser Phe Gly Asp Leu Gly Ser Ala Lys Leu Gly Gin Arg Pro Leu 

AAO 44-3 

ser Gin Pro Ala Gly He Ser Thr Asn Pro Phe Met Thr Gly Pro 

455 Z-i 
ser ser Ser Pro Phe Ala Ser Lys Pro Pro Thr Thr Asn Pro Phe 

470 

Leu 



<210> 2 

<211> 195 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc„feature 

<223> Incyte ID No: 383249CD1 

MeS'seJ ser Cys Ser Asn Val Cys Gly Ser Arg Gin Ala Gin Ala 

Ala Ala Glu Gly Gly Tyr Gin Arg Tyr Gly Val Arg Ser Tyr Leu 
20 

His Gin Phe Tyr Glu Asp Cys Thr Ala Ser He Trp Glu Tyr Glu 



ASP ASP Phe Gin ill Gin Arg Ser Pro Asn Arg Trp Ser Ser Val 

Phe Trp Lys Val GlJ Leu He Ser Gly Thr Val Phe Val He Leu 

65 

Gly Leu Thr Val Leu Ala Val Gly Phe Leu Val Pro Pro Lys He 

80 

Glu Ala Phe Gly Glu Ala Asp Phe Val Val Val Asp Thr His Ala 

95 100 
val Gin Phe Asn Ser Ala Leu Asp Met Tyr Lys Leu Ala Gly Ala 

110 l-'^^ 
val Leu Phe Cys He Gly Gly Thr Ser Met Ala Gly Cys Leu Leu 

125 

Met ser Val Phe Val Lys Ser Tyr Ser Lys Glu Glu Lys Phe Leu 

140 

Gin Gin Lys Phe Lys Glu Arg He Ala Asp He Lys Ala His Thr 
155 1«0 -^"-^ 

Gin pro val Thr Lys Ala Pro Gly Pro Gly Glu Thr Lys He Pro 

170 

val Thr Leu Ser Arg Val Gin Asn Val Gin Pro Leu Leu Ala Thr 
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<210> 3 

<211> 313 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 618769CD1 

Me?°GlJ Asn Phe Asp Asn Ala Asn Val Oto Gly Glu He Glu Phe 

Ala He His Tyr Cys Phe Lys Thr His Ser Leu Glu He Cys lie 

20 25 
Lys Ala cys Lys Asn Leu Ala Tyr Gly Glu Glu Lys Lys Lys Lys 

35 40 
cys Asn Pro Tyr Val Lys Thr Tyr Leu Leu Pro Asp Arg Ser Ser 

50 55 
Gin Gly Lys Arg Lys Thr Gly Val Gin Arg Asn Thr Val Asp Pro 

Thr Phe Gin Glu Thr Leu Lys oyr Gin Val Ala Pro Ala Gin Leu 

80 85 
Val Thr Arg Gin Leu Gin Val Ser Val Trp His Leu Gly Thr Leu 
95 100 J-U^ 

Ala Arg Arg Val Phe Leu Gly Glu Val He He Pro Leu Ala Thr 

110 ii5 ^ 

Trp Asp Phe Glu Asp Ser Thr Thr Gin Ser Phe Arg Trp Hxs Pro 

125 130 
L6U Arg Ala Lys Ala Glu Bys Tyr Glu Asp Ser Val Pro Gin Ser 
140 145 J^3u 

Asn Gly Glu Leu Thr Val Arg Ala Bys Leu Val Leu Pro Ser Arg 
155 160 -i-"^ 

Pro Arg Lys Leu Gin Glu Ala Gin Glu Gly Thr Asp Gin Pro Ser 
170 175 -LOU 

Leu His Gly Gin Leu Cys Leu Val Val Leu Gly Ala Lys Asn Leu 

185 l^'' 
Pro val Arg Pro Asp Gly Thr Leu Asn Ser Phe Val Lys Gly Cys 

200 205 
Leu Thr Leu Pro Asp Gin Gin Lys Leu Arg Leu Lys Ser Pro Val 

215 220 
Leu Arg Lys Gin Ala Cys Pro Gin Trp Lys His Ser Phe Val Phe 

230 235 
Ser Gly Val Thr Pro Ala Gin Leu Arg Gin Ser Ser Leu Glu Leu 

245 250 
Thr Val Trp Asp Gin Ala Leu Phe Gly Met Asn Asp Arg Leu Leu 

260 265 
Gly Gly Thr Arg Leu Gly Ser Lys Gly Asp Thr Ala Val Gly Gly 
275 280 285 

Asp Ala cys ser Leu Ser Lys Leu Gin Trp Gin Lys Val Leu Ser 

ser pro Asn Leu Trp Thr Asp Met Thr Leu Val Leu His 
305 310 

<210> 4 

<211> 201 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> Incyte ID No: 1234837CD1 

M^t^Gly Asn Leu Phe Gly Arg Lys Lys Gin Ser Arg Val Thr Glu 
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1 5 10 15 

Gin ASP Lys Ala He Leu Gin Leu Lys Gin Gin Arg Asp Lys Leu 

20 25 -^^ 

Arg Gin Tyr Gin Lys Arg He Ala Gin Gin Leu Glu Arg Glu Arg 

35 40 
Ala Leu Ala Arg Gin Leu Leu Arg Asp Gly Arg Lys Glu Arg Ala 

50 55 0" 

Lys Leu Leu Leu Lys Lys Lys Arg Tyr Gin Glu Gin Leu Leu Asp 

65 7" 
Arg Thr Glu Asn Gin He Ser Ser Leu Glu Ala Met Val Gin Ser 

80 85 
He Glu Phe Thr Gin He Glu Met Lys Val Met Glu Gly Leu Gin 

95 100 
Phe Gly Asn Glu Cys Leu Asn Lys Met His Gin Val Met Ser He 

110 115 
Glu Glu val Glu Arg He Leu Asp Glu Thr Gin Glu Ala Val Glu 
125 

Tyr Gin Arg Gin He Asp Glu Leu Leu Ala Gly Ser Phe Thr Gin 

140 145 
Glu Asp Glu Asp Ala He Leu Glu Glu Leu Ser Ala He Thr Gin 

155 loO ^ 

Glu Gin He Glu Leu Pro Glu Val Pro Ser Glu Pro Leu Pro Glu 

170 

Lys He Pro Glu Asn Val Pro Val Lys Ala Arg Pro Arg Gin Ala- 
185 

Glu Leu Val Ala Ala Ser 
200 

<210> 5 

<211> 566 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 1607223CD1 

ser cys Gly Lys Arg Phe His Asp Glu Val Gly Lys Phe 
1 5 10 

Arg Phe Leu Asn Glu Leu He Lys Val Val Ser Pro Lys Tyr Leu 

20 25 
Glv ser Arg Thr Ser Glu Lys Val Lys Asn Lys He Leu Glu Leu 

35 40 
Leu Tyr ser Trp Thr Val Gly Leu Pro Glu Glu Val Lys He Ala 

50 55 
Glu Ala Tyr Gin Met Leu Lys Lys Gin Gly He Val Lys Ser Asp 

Pro Lys Leu Pro A^p Asp Thr Thr Phe Pro Leu Pro Pro Pro Arg 
80 85 t , 

Pro Lys Asn Val He Phe Glu Asp Glu Glu Lys Ser Lys Met Leu 
95 100 -Luo 

Ala Arg Leu Leu Lys Ser Ser His Pro Glu Asp Leu Arg Ala Ala 
110 115 

Asn Lys Leu He Lys Glu Met Val Gin Glu Asp Gin Lys Arg Met 

Glu Lys He Ser ^s Arg Val Asn Ala He Glu Glu Val Asn Asn 

140 145 
Asn Val Lys Leu Leu Thr Glu Met Val Met Ser His Ser Gin Gly 

155 160 
Gly Ala Ala Ala Gly Ser Ser Glu Asp Leu Met Lys Glu Leu lyr 

170 1'5 
Gin Arg Cys Glu Arg Met Arg Pro Thr Leu Phe Arg Leu Ala Ser 

185 190 = 

Asp Thr Glu Asp Asn Asp Glu Ala Leu Ala Glu He Leu Gin Ala 

200 205 ■^•"■^ 

Asn Asp Asn Leu Thr Gin Val He Asn Leu Tyr Lys Gin Leu Val 
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215 220 225 

Arg Gly Glu Glu Val Asn Gly Asp Ala Thr Ala Gly Ser lie Pro 

230 235 
Gly ser Thr Ser Ala Leu Leu Asp Leu Ser Gly Leu Asp Leu Pro 
245 250 ^^•> 

Pro Ala Gly Thr Thr Tyr Pro Ala Met Pro Thr Arg Pro Gly Glu 

260 265 
Gin Ala ser Pro Glu Gin Pro Ser Ala Ser Val Ser Leu Leu Asp 

275 280 
Asp Glu Leu Met Ser Leu Gly Leu Ser Asp Pro Thr Pro Pro Ser 
290 295 J"" 

Gly Pro ser Leu Asp Gly Thr Gly Trp Asn Ser Phe Gin Ser Ser 
305 310 -Jla 

ASP Ala Thr Glu Pro Pro Ala Pro Ala Leu Ala Gin Ala Pro Ser 

320 325 , „ t 

Met Glu Ser Arg Pro Pro Ala Gin Thr Ser Leu Pro Ala Ser Ser 
335 340 -»^^ 

Gly Leu Asp Asp Leu Asp Leu Leu Gly Lys Thr Leu Leu Gin Gin 

350 355 
ser Leu Pro Pro Glu Ser Gin Gin Val Arg Trp Glu Lys Gin Gin 
365 370 375 

Pro Thr Pro Arg Leu Thr Leu Arg Asp Leu Gin Asn Lys Ser Ser 

380 385 ^ 

Ser cys Ser Ser -Pro Ser Ser Ser Ala Thr Ser Leu Leu His Thr 

395 400 
val ser Pro Glu Pro Pro Arg Pro Pro Gin Gin Pro Val Pro Thr 

410 415 
Glu Leu ser Leu Ala Ser lie Thr Val Pro Leu Glu Ser He Lys 
425 430 43> 

Pro ser Asn He Leu Pro Val Thr Val Tyr Asp Gin His Gly Phe 

440 445 
Arg He Leu Phe His Phe Ala Arg Asp Pro Leu Pro Gly Arg Ser 

455 460 
ASP val Leu val Val Val Val Ser Met Leu Ser Thr Ala Pro Gin 

470 475 
pro He Arg Asn He Val Phe Gin Ser Ala Val Pro Lys Val Met 

485 490 
Lys val Lys Leu Gin Pro Pro Ser Gly Thr Glu Leu Pro Ala Phe 
^ 500 505 510 

Asn Pro He Val His Pro Ser .Ala He Thr Gin Val Leu Leu Leu 

53^5 520 
Ala Asn Pro Gin Lys Glu Lys Val Arg Leu Arg Tyr Lys Leu Thr 
530 535 =>^^ 

Phe Thr Met Gly Asp Gin Thr Tyr Asn Glu Met Gly Asp Val Asp 
545 550 ^-'^ 

Gin Phe Pro Pro Pro Glu Thr Trp Gly Ser Leu 
560 , 565 

<210> 6 

<211> 270 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1621554CD1 

M^t^Gly Phe Trp Cys Arg Met Ser Glu Asn Gin Glu Gin Glu Glu 

1 5 ^ 

Val He Thr Val Arg Val Gin Asp Pro Arg Val Gin Asn Glu Gly 

20 25 30 

ser Trp Asn Ser Tyr Val Asp Tyr Lys He Phe Leu His Thr Asn 

35 40 4b 

Ser Lys Ala Phe Thr Ala Lys Thr Ser Cys Val Arg Arg Arg Tyr 

50 55 
Arg Glu Phe Val Trp Leu Arg Lys Gin Leu Gin Arg Asn Ala Gly 
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65 70 75 

Leu Val Pro Val Pro Glu Leu Pro Gly Lys Ser Thr Phe Phe Gly 

80 . 
Thr ser Asp Glu Phe He Glu Lys Arg Arg Gin Gly Leu Gin Hxs 

Phe Leu Glu Lys Val Leu Gin Ser Val Val Leu Leu Ser Asp Ser 

110 ±j-^ r* 

Gin Leu His Leu Phe Leu Gin Ser Gin Leu Ser Val Pro Glu lie 

125 1 
Glu Ala cys Val Gin Gly Arg Ser Thr Met Thr Val Ser Asp Ala 

140 ■^-^^ T 

lie Leu Arg Tyr Ala Met Ser Asn Cys Gly Trp Ala Gin Glu Glu 

155 LOU 
Arg Gin ser Ser Ser His Leu Ala Lys Gly Asp Gin Pro Lys Ser 

170 

cys cys Phe Leu Pro Arg Ser Gly Arg Arg Ser Ser Pro Ser Pro 

Pro Pro ser Glu g?u Lys Asp His Leu Glu Val Trp Ala Pro Val 
200 20b ^-^^ 

val ASP ser Glu Val Pro Ser Leu Glu Ser Pro Thr Leu Pro Pro 

215 2Zi} 
Leu Ser Ser Pro Leu Cys cys Asp Phe Gly Arg Pro Lys Glu Gly 

230 23b 
a?hr ser Thr Leu Gin Ser Val Arg Arg Ala Val Gly Gly Asp Hxs 



245 250 
Ala val Pro Leu Asp Pro Gly Gin Leu Glu Thr Val Leu Glu Lys 
260 265 



<210> 7 
<211> 490 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1751553CD1 

M^^AlI Thr Glu Phe He Lys Ser Cys Cys Gly Gly Cys Phe Tyr 

Gli Glu Thr Glu Lys His Asn Phe Ser Val Glu Arg Asp Phe Lys 

20 25 
Ala Ala Val Pro Asn Ser Gin Asn Ala Thr He Ser Val Pro Pro 

35 

Leu Thr Ser Val Ser Val Lys Pro Gin Leu Gly Cys Thr Glu Asp 

50 Z 
Tyr Leu Leu Ser Lys Leu Pro Ser Asp Gly Lys Glu Val Pro Phe 

65 

val val Pro Lys Phe Lys Leu Ser Tyr lie Gin Pro Arg Thr Gin 

80 "-^ 
Glu The Pro ser His Leu Glu Glu Leu Glu Gly Ser Ala Arg Ala 

ser Phe Gly Asp a?§ Lys Val Glu Leu Ser Ser Ser Ser Gin His 
110 . 

Gly pro ser Tyr Asp Val Tyr Asn Pro Phe Tyr Met Tyr Gin His 
125 T 

lie ser Pro Asp Leu ser Arg Arg Phe Pro Pro Arg Ser Glu Val 
140 

Thr Arg Leu Tyr Gly Ser Val Cys Asp Leu Arg Thr Asn Lys Leu 

Pro Gly ser Pro Gly Leu Ser Lys Ser Met Phe Asp Leu Thr Asn 
170 xi^ !: 

ser ser Gin Arg Phe He Gin Arg His Asp Ser Leu Ser Ser Val 

pro ser Ser Ser sS Ser Arg Lys Asn Ser Gin Gly Ser Asn Arg 
200 205 ''^^ 
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Ser Leu Asp Thr He Ito: Leu Ser Gly Asp Glu Arg Asp Phe Gly 



Arg Leu Asn Val HI Leu Phe Tyr Asn Ser Ser Val Glu Gin lie 



230 

Trp He Thr Val Leu Gin cys Arg Asp Leu Ser Trp Pro Ser Ser 



235 



Tyr Gly Asp Thr l^o Thr Val Ser He Lys Gly He Leu Ito Leu 

260 



pro Lys pro Val His Phe Lys Ser Ser Ala Lys Glu Gly Ser Asn 

Phe 
295 

Lys 

305 



275 - 
Ala He Glu Phe Met Glu Thr Phe Val Phe Ala He Lys Leu Gin 
290 29b 



Asn Leu Gin Thr Val Arg Leu Val Phe Lys He Gin Thr Gin Thr 



Pro Arg Lys Lys Thr He Gly Glu Cys Ser Met Ser Leu Arg Thr 
Leu ser Thr Gin lln Met Asp Tyr Ser Leu Asp He Thr Pro Pro 
ser Lys He Ser III Cys His Ala Glu Leu Glu Leu Gly Thr Cys 
Phe Gin Ala Val Sn Ser Arg He Gin Leu Gin He Leu Glu Ala 
Arg Tyr Leu Pro S^ Ser Ser Thr Pro Leu Thr Leu Ser Phe Phe 
val Lys val Gly Me? Phe Ser Ser Gly Glu Leu He Tyr Lys Lys 
r.ys Thr Arg Leu Leu Lys Ala Ser Asn Gly Arg Val Lys Trp Gly 

Glu Thr Met He til Pro Leu He Gin Ser Glu Lys Glu He Val 
425 ^"^^ . 

Phe Leu He Lys Leu Tyr Ser Arg Ser Ser Val Arg Arg Lys Hxs 
440 

Phe val Gly Gin He Trp He Ser Glu Asp Ser Asn Asn He Glu 

Ala val Asn Gin S Lys Glu Thr Val He Asn Pro Glu Lys Val 

470 

Val He Arg Trp His Lys Leu Asn Pro Ser 
485 

<210> 8 
<211> 136 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 1832403CD1 

^Tlll Gly Ala Gly Ser Ala Ala Val Ser Gly Ala Gly ^ Pro 

Vai Ala Gly Pro 'Thr Gly Arg Asp Leu Phe Ala Glu Gly Leu Leu 
20 

Glu Phe Leu Arg Pro Ala Val Gin Gin Leu Asp Ser His Val His 
Ala val Arg Glu Ser Gin Val Glu Leu Arg Glu Gin He Asp Asn 



Leu Ala Thr Glu lS Cys Arg He Asn Glu Asp Gin Lys Val Ala 



70 



75 



Leu ASP Leu Asp Pro Tyr Val Lys Lys Leu Leu Asn Ala Arg Arg 

80 

Arg val Val Leu Val Asn Asn He Leu Gin Asn Ala Gin Glu Arg 

95 100 
Leu Arg Arg Leu Asn His Ser Val Ala Lys Glu Thr Ala Arg Arg 

110 1 
Arg Ala Met Leu Asp Ser Gly He Tyr Pro Pro Gly Ser Pro Gly 



125 
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Lys 

<210> 9 
<211> 1104 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> iaisc__feature 

<223> Incyte ID No: 19717 47CD1 

M^^'gIu Arg ser Pro Gly Glu Gly Pro Ser Pro Ser Pro Met Asp 

Gin pro Ser Ala Pro Ser Asp Pro Hbr Asp Gin Pro Pro Ala Ma 

20 

His Ala Lys Pro Asp Pro Gly Ser Gly Gly Gin Pro Ala Gly Pro 

35 

Gly Ala Ala Gly Glu Ala Leu Ala Val Leu Thr Ser Phe Gly Arg 

50 n 

Arg Leu Leu Val Leu He Pro Val Tyr Leu Ala Gly Ala Val Gly 

65 n 

Leu ser Val Gly Phe Val Leu PHe Gly Leu Ala Leu Tyr Leu Gly 



80 

Trp Arg Arg Val Arg Asp Glu Lys Glu Arg Ser Leu Arg Ala Ala 



Arg Gin Leu Leu A^p Asp Glu Glu Gin Leu Thr Ala Lys -nhr Leu 

110 11^ 
Tyr Met Ser His Arg Glu Leu Pro Ala Trp Val Ser Phe Pro Asp 

125 1-^0 
val Glu Lys Ala Glu Trp Leu Asn Lys lie Val Ala Gin Val Trp 

Leu 

Ala pro Ala Val ^^g Gly Ser Asn Pro His Leu Gin Thr Phe Thr 



100 



pro Phe Leu Gly Gin Tyr Met Glu Lys Leu Leu Ala Glu Thr Val 
155 



17 0 — . — 

Phe Thr Arg Val Glu Leu Gly Glu Lys Pro Leu Arg He He Gly 

185 190 
val Lys val His Pro Gly Gin Arg Lys Glu Gin He Leu Leu A^ 
200 ^'^^ 
Leu Asn He Ser Tyr Val Gly Asp Val Gin He Asp Val Glu Val 

Lys Lys Tyr Phe ^s Lys Ala Gly Val Lys Gly Met Gin Leu His 

Gly val Leu Arg III He Leu Glu Pro Leu He Gly Asp Leu Pro 

245 ^ 
Phe val Gly Ala Val Ser Met Phe Phe He Arg Arg Pro Thr Leu 

260 -^^^ T 

ASP He Asn Trp Thr Gly Met Thr Asn Leu Leu Asp He Pro Gly 

275 '^°0 " 

Leu ser Ser Leu Ser Asp Thr Met He Met Asp Ser He Ala Ala 

290 

Phe Leu val Leu Pro Asn Arg Leu Leu Val Pro Leu Val Pro Asp 

305 ^10 t 

Leu Gin ASP val Ala Gin Leu Arg Ser Pro Leu Pro Arg Gly He 

320 

He Arg He His Leu Leu Ala Ala Arg Gly Leu Ser Ser Lys Asp 

335 

Lys Tyr Val Lys Gly Leu He Glu Gly Lys ser Asp Pro Tyr Ala 

350 355 
Leu val Arg Leu Gly Thr Gin Thr Phe Cys Ser Arg Val He Asp 

365 *^'0 "z 

Glu Glu Leu Asn Pro Gin Trp Gly Glu Thr Tyr Glu Val Met Val 

380 *^°^ 
His Glu val Pro Gly Gin Glu He Glu Val Glu Val Phe Asp Lys 

395 
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Asp pro Asp r.ys Asp Asp Phe Leu Gly Arg Met Lys Leu Asp Val 

410 415 
Gly LYS Val Leu Gin Ala Ser Val Leu Asp Asp Trp Phe Pro Leu 

425 430 
Gin Gly Gly Gin Gly Gin Val His Leu Arg Leu Glu Trp Leu Ser 

440 445 
Leu Leu Ser Asp Ala Glu Lys Leu Glu Gin Val Leu Gin Trp Asn 

455 460 
Trp Gly Val Ser Ser Arg Pro Asp Pro Pro Ser Ala Ala lie Leu 

470 475 
Val val Tyr Leu Asp Arg Ala Gin Asp Leu Pro Leu Lys Lys Gly 

485 490 
Asn Lys Glu Pro Asn Pro Met Val Gin Leu Ser He Gin Asp Val 
500 505 ^xo 

Thr Gin Glu ser Lys Ala Val Tyr Ser Thr Asn cys Pro Val Trp 

515 520 
Glu Glu Ala Phe Arg Phe Phe Leu Gin Asp Pro Gin Ser Gin Glu 

530 535 
Leu Asp Val Gin Val Lys Asp Asp Ser Arg Ala Leu Thr Leu Gly 

545 550 
Ala Leu Thr Leu Pro Leu Ala Arg Leu Leu Thr Ala Pro Glu Leu 

560 565 
He Leu Asp Gin Trp Phe Gin Leu Ser Ser Ser Gly Pro Asn Ser 
575 580 ^'^^ 

Arg Leu Tyr Met Lys Leu Val Met Arg lie Leu Tyr Leu Asp Ser 

590 595 
ser Glu He Cys Phe Pro Thr Val Pro Gly Cys Pro Gly Ala Trp 
60S 610 °J-=> 

Asp Val Asp ser Glu Asn Pro Gin Arg Gly Ser Ser Val Asp Ala 

620 625 
Pro pro Arg Pro Cys His Thr Thr Pro Asp Ser Gin Phe Gly Thr 

635 640 
Glu His Val Leu Arg He His Val Leu Glu Ala Gin Asp Leu He 
650 655 650 

Ala Lys Asp Arg Phe Leu Gly Gly Leu Val Lys Gly Lys Ser Asp 
665 670 "'^ 

Pro Tyr Val Lys Leu Lys Leu Ala Gly Arg Ser Phe Arg Ser His 

680 685 
val Val Arg Glu Asp Leu Asn Pro Arg Trp Asn Glu Val Phe Glu 
695 700 '"3 

Val He Val Thr Ser Val Pro Gly Gin Glu Leu Glu Val Glu Val 
710 715 z-^" 

Phe Asp Lys ASP Leu Asp Lys Asp Asp Phe Leu Gly Arg Cys Lys 

725 '30 
val Arg Leu Thr Thr Val Leu Asn Ser Gly Phe Leu Asp Glu Trp 

740 745 
Leu Thr Leu Glu Asp Val Pro Ser Gly Arg Leu His Leu Arg Leu 

755 760 
Glu Arg Leu Thr Pro Arg Pro Thr Ala Ala Glu Leu Glu Glu Val 

770 775 
Leu Gin val Asn Ser Leu He Gin Thr Gin Lys Ser Ala Glu Leu 

785 790 
Ala Ala Ala Leu Leu Ser He Tyr Met Glu Arg Ala Glu Asp Leu 

800 805 
Pro Leu Arg Lys Gly Thr Lys His Leu Ser Pro Tyr Ala Thr Leu 
815 820 o2b 

Thr Val Gly Asp Ser Ser His Lys Thr Lys Thr He Ser Gin Thr 

830 835 
ser Ala Pro Val Trp Asp Glu Ser Ala Ser Phe Leu He Arg Lys 
845 850 855 

Pro His Thr Glu Ser Leu Glu Leu Gin Val Arg Gly Glu Gly Thr 
860 865 870 

Glv Val Leu Gly Ser Leu Ser Leu Pro Leu Ser Glu Leu Leu Val 
' 875 880 885 

Ala Asp Gin Leu cys Leu Asp Arg Trp Phe Thr Leu Ser Ser Gly 

Gin Gly Gin Val Leu Leu Arg Ala Gin Leu Gly He Leu Val Ser 
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905 910 . 

Gin His ser Gly Val Glu Ala His Ser His Ser Tyr Ser His Ser 

ser ser Ser Leu Ser Glu Glu Pro Glu Leu Ser Gly Gly Pro Pro 
His lie Thr Ser ler Ala Pro Glu Leu Arg Gin Arg Leu -Ehr His 

val ASP ser Pro Leu Glu Ala Pro Ala Gly Pro Leu Gly Gin Val 

965 

Lys Leu Thr Leu Trp Tyr Tyr Ser Glu Glu Arg Lys Leu val Ser 

980 

lie val His Gly Cys Arg Ser Leu Arg^Gln Asn Gly Arg Asp^Pro 
pro ASP pro Tyr Jal Ser Leu Leu Leu^Leu Pro Asp Lys Asn^Arg 
Gly Thr Lys Arg^^g Thr Ser Gin Lys Lys Arg Thr Leu Ser Pro 

1025 1030 xw-^-- 

Glu Phe Asn Glu Arg Phe Glu Trp Glu Leu Pro Leu Asp Glu Ala 

1040 1045 
Gin Arg Arg Lys Leu Asp Val Ser Val^Lys Ser Asn Ser Ser^Phe 

Met ser Arg Glu'SI Glu Leu Leu Gly Lys Val Gin Leu Asp Leu 

1070 1075 xuow 

Ala Glu Thr ASP Leu Ser Gin Gly Val Ala Arg Trp Tyr Asp Leu 

1085 1050 -"-"^-^ 

Met Asp Asn Lys Asp Lys Gly Ser Ser 

1100 

<210> 10 
<211> 411 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> incyte ID No: 2285348CD1 

M^^Ala'Asn Tyr Glu Ser Thr Glu Val Met Gly Asp Gly Glu Ser 

Ala His Asp Ser Pro Arg Asp Glu Ala Leu Gin Asn He Ser Ala 

20 

ASP ASP Leu pro Asp Ser Ala Ser Gin Ala Ala His Pro Gin Asp 



ser Ala Phe Ser ^ Arg Asp Ala Lys L^ Lys Leu Arg Leu Ala 



50 55 

Leu cys Ser Ala Asp ser Val Ala Phe Pro Val Leu Thr Hxs Ser 

Thr Arg Asn Gly lSu Pro Asp His Thr Asp Pro Glu Asp Asn Glu 
80 

lie val cys Phe Leu Lys Val Gin He Ala Glu Ala lie Asn Leu 

Gin ASP Lys Asn Leu Met Ala Gin Leu Gin Glu Thr Met Arg cys 

110 ll-> 
val cys Arg Phe Asp Asn Arg Thr Cys Arg Lys Leu Leu Ala Ser 

125 1-^0 
lie Ala Glu ASP Tyr Arg Lys Arg Ala Pro Tyr He Ala Tyr Leu 

140 T 
Thr Arg Cys Arg Gin Gly Leu Gin Thr Thr Gin Ala His Leu Glu 

Arg Leu Leu Gin S Val Leu Arg Asp Lys Glu Val Ala Asn Arg 
170 ; 

Tyr Phe Thr Thr Val Cys Val Arg Leu Leu Leu Glu Ser Lys Glu 
185 190 ■'■^^ 

Lys Lys He Arg Glu Phe He Gin Asp Phe Gin Lys Leu Thr Ala 
200 20o 

Ala ASP Asp Lys Thr Ala Gin Val Glu Asp Phe Leu Gin Phe Leu 
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215 220 225 

Tyr Gly Ala Met Ala Gin Asp Val He Trp Gin Asn Ala Ser Glu 

Glu Gin Leu Gin Sp Ala Gin Leu Ala lie Glu Arg Ser Val Met 

Asn Arg He Phe Lys Leu Ala Phe Tyr Pro Asn Gin Asp Gly Asp 

lie Leu Arg Asp Gin Val Leu His Glu His He Gin Arg Leu Ser 



Lys val val Thr £1 Asn His Arg Ala Leu Gin lie Pro Glu Val 



Tyr Leu Arg Glu Ala Pro Tzp Pro Ser Ala Gin Ser Glu He Arg 

Thr lie ser Ala ^ Lys Thr Pro Arg Asp Lys Val Gin Cys lie 

320 jAo 
Leu Arg Met cys Ser Thr He Met Asn Leu Leu Ser Leu Ala Asn 

Glu ASP ser Val '^o Gly Ala Asp Asp Phe Val Pro Val Leu Val 

Phe val Leu He Lys Ala Asn Pro Pro cys Leu Leu Ser Thr Val 

om Tyr He Ser ler Phe Tyr Ala Ser Cys Leu Ser Gly Glu Glu 

380 ooD 
ser Tyr Trp Trp Met Gin Phe Thr Ala Ala Val Glu Phe He Lys 

Thr He Asp Asp Arg Lys 
410 

<210> 11 

<211> 201 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2374186CD1 

Mt?°Phe'pro Glu Gin Gin Lys Glu Glu Phe Val Ser Val Trp Val 



295 



Arg ASP pro Arg 111 Gin Lys Glu Asp Phe Trp His Ser Tyr He 

ASP Tyr Glu He Cys He His Thr Asn Ser Met Cys Phe Thr Met 

Lys Thr ser Cys vl! Arg Arg Arg Tyr Arg Glu Phe Val Trp Leu 

50 1 
Arg Gin Arg Leu Gin Ser Asn Ala Leu Leu Val Gin Leu Pro Glu 

Leu pro ser Lys A^n Leu Phe Phe Asn Met Asn Asn Arg Gin His 
80 

val ASP Gin Arg Arg Gin Gly Leu Glu Asp Phe Leu Arg Lys Val 

95 lou 
Leu Gin Asn Ala Leu Leu Leu Ser Asp Ser Ser Leu His Leu Phe 

110 

Leu Gin Ser His Leu Asn Ser Glu Asp He Glu Ala Cys Val Ser 
125 1 

Gly Gin Thr Lys Tyr Ser Val Glu Glu Ala He His. Lys Phe Ala 
140 T 

Leu Met Asn Arg Arg Phe Pro Glu Glu Asp Glu Glu Gly Lys Lys 
155 

Glu Asn ASP He Asp Tyr Asp Ser Glu Ser Ser Ser Ser Gly Leu 

170 ■^^^ 
Gly His ser Ser Asp Asp Ser Ser Ser His Gly Cys Lys Val Asn 

185 

Tlir Ala Pro Gin Glu Ser 
200 
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<210> 12 

<211> 476 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> itdsc__featurG 

<223> Incyte ID No: 2476232CD1 

MftMi'phe Gly Lvs ser His Asp Pro ^ Ala Thr Ser Val 
Oli His I.eu lie Glu Lys Ala Thr Phe Ala Gly Val Gin 'Ttr Glu 
Trp Gly Gin vll Met His lie Cys .Asp He He Asn 0^ -nhr 
Gin ASP Gly pro J Asp Ala Val Lys Ala Leu Lys Lys Arg lie 
ser Lys Asn Tyr A^n His Lys Glu He Gin Leu ^ Leu ser Leu 
lie ASP Met cys vfl Gin Asn Cys Gly Pro Ser Phe Gin Ser Leu 
. lie val Lys Lys g?u Phe Val Lys Glu Asn Leu Val Lys Leu Leu 
Asn pro Arg Tyr Asn Leu Pro Leu Asp lie Gin Asn Arg He Leu 
P.e He Lys Trp ser Gin Gly PJe Pro Gly Gly Val Asp 
val ser Glu Val lH Glu Val Tyr Leu Asp Leu Val Lys Lys Gly 
val Gin Phe Pro ^o Ser Glu Ala Glu Ala Glu Tbr Ala Arg Gin 
Olu -mr Ala Gin III Ser Ser Asn Pro Pro Thr Ser Val Pro Thr 
pro Ala Leu III Ser Val He Ala Pro Lys Asn Ser Thr Val 
.eu val pro g!u Ha Gly Lys Leu His Ser Glu Leu Asp 
Met val Lys Met Val Arg Val Met Ser Ala He Leu Met Glu 
Asn Thr pro Gly HI Glu Asn His Glu Asp He Glu Leu Leu Gin 
I.ys Leu Tyr Lys ^ Gly Arg Glu Met Gin Glu Arg He Met Asp 
.eu Leu val Val vfl Glu Asn Glu Asp Val Thr Val Glu Leu lie 
om val Asn Glu ^P Leu Asn Asn Ala lie Leu Gly Tyr Glu Arg 
Pbe Thr Arg Asn Gil Gin Arg He Leu Glu Gin Asn Lys Asn Gin 
I.ys Glu Ala Thr i^n Thr Thr Ser Glu Pro Ser Ala Pro Ser Gin 
;^p Leu Leu ASP Leu ser Pro Ser Pro Arg Met Pro Arg Ala Thr 
X.eu Gly Glu Leu Thr Met Asn Asn Gin Leu Ser Gly Leu Asn 
Pbe ser Leu Pro III Ser Asp Val Thr Asn Asn Leu Lys Pro Ser 
Leu His pro Gin mI2 Asn Leu Leu Ala Leu Glu Asn Thr Glu He 
pro Phe Ala g£ Arg Thr Ser Gin Asn Leu Thr Ser Ser His 
Ma Tyr ASP Asn Phe Leu Glu His Ser Asn ser Val Phe Leu Gin 
P.O val ser Leu Thr He Ala Ala Ala Pro Ser Asn Gin Ser 
Leu pro pro Leu Pro Ser Asn His Pro Ala Met Thr Lys Ser Asp 
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I^u Gin pro Pro Asn lYr Tyr Glu Val Met Glu Phe Asp Pro Leu 

Ala pro Ala Val Thr Thr Glu Ala He Tyr Glu Glu lie Asp Ala 

455 ^ 
His Gin His Lys Gly Ala Gin Asn Asp Gly Asp 
470 



<210> 13 

<211> 1220 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc„feat\ire ^^^^^^ 
<223> Incyte ID No: 2503986CD1 



Mft°Ly"Leu Lys Glu Val Asp Arg Thr Ala Met Gin Ala Trp Ser 
Pro Ala Gin Asn nil Pro He Tyr Leu Ala Thr Gly Ilu: Ser Ala 
Gin Gin Leu Asp Ala Phe Ser Thr Asn Ala Ser Leu Glu lie 
Phe Glu Leu ASP Leu Ser Asp Pro Ser Leu Asp Met Lys Ser cys 
Ala Thr Phe Ser sir Ser His Arg Tyr His Lys Leu He Trp Gly 

Pro Tyr Lys Met A^p Ser Lys Gly Asp Val Ser Gly Val Leu lie 

80 

Ala Gly Gly Glu Asn Gly Asn He He Leu Tyr Asp Pro Ser Lys 
He He Ala Gly A^p Lys Glu Val Val He Ala Gin Asn Asp Lys 

Arg Ala Leu Asp 
Gly Ala Asn Glu 
Ala Thr Pro Met 
He ser Cys He 
Gin His He Leu ila Ser Ala Ser Pro Ser Gly Arg Ala Thr Val 
Trp ASP Leu Arg lS Asn Glu Pro He lie Lys Val Ser Asp His 
ser Asn Arg Met h?s Cys Ser Gly Leu Ala Trp His Pro Asp Val 
Ala Thr Gin Met vK Leu Ala Ser Glu Asp Asp Arg Leu Pro Val 



His Thr Gly pro HI Arg Ala Leu Asp vll Asn He Phe Gin Thr 



Asn Leu Val Ala Ser Gly Ala Asn Glu Ser Glu He Tyr He T^ 



130 



ASP Leu Asn Asn lil Ala Thr Pro Met Pro Gly Ala Lys ^ 



145 



Gin pro pro Glu ^p He Ser Cys He Ala Trp Asn Arg Gin Val 



160 



He Gin Met Trp 1^ Leu Arg Phe Ala Ter Ser Pro Leu Arg Val 



Leu Glu Asn His aJI Arg Gly He Leu Ala He Ala Trp Ser Met 



250 



260 



Leu cys ser Asn S Asn Thr Gly Glu Vai Leu Tyr Glu Leu Pro 



Ala ASP pro Glu Leu Leu Leu Ser Cys Gly Lys Asp Ala Lys H| 

Asn Thr Gly Glu 

290 

Ku: Asn Thr Gin Trp Cys Phe Asp He Gin ?rp Cys Pro Arg Asn 



295 



pro Ala val Leu ler Ala Ala Ser Phe A^ Gly Arg He Ser Val 

320 J/&-> 
Tyr ser He Met Gly Gly Ser Thr Asp Gly Leu Arg Gin Lys Gin 



val ASP Lys Leu ler Ser Ser Phe Gly aJu Leu Asp Pro Phe Gly 



355 



Thr Gly Gin Pro Leu Pro Pro Leu Gin He Pro Gin Gin Thr Ala 
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365 370 375 

Gin His Ser He Val Leu Pro Leu Lys Lys Pro Pro Lys Trp lie 
380 

Arg Arg Pro Val Gly Ala Ser Phe Ser Phe Gly Gly Lys Leu Val 

395 4l)U -aw-F 

Ohr Phe Glu Asn Val Arg Met Pro Ser His Gin Gly Ala Glu Gin 
410 t 

Gin Gin Gin Gin His His Val Phe He Ser Gin Val Val Thr Glu 
425 ^30 ^-^^ 

Lys Glu Phe Leu Ser Arg Ser Asp Gin Leu Gin Gin Ala Val Gin 
4^0 

ser Gin Gly Phe He Asn Tyr Cys Gin Lys Lys He Asp Ala Ser 

455 

Gin oair Glu Phe Glu Lys Asn Val Trp Ser Phe Leu Lys Val Asn 

470 ^'^ 
Phe Glu ASP ASP ser Arg Gly Lys Tyr Leu Glu Leu Leu Gly Jyr 

485 

Arg Lys Glu Asp Leu Gly Lys Lys lie Ala Leu Ala Leu Asn Lys 
^ 5Q0 505 ^-^^ 

val ASP Gly Ala Asn Val Ala Leu Lys Asp Ser Asp Gin Val Ala 

515 520 -^^-^ 

Gin ser Asp Gly Glu Glu Ser Pro Ala Ala Glu Glu Gin Leu Leu 

530 535 
Gly Glu His He Lys Glu Glu Lys Glu Glu Ser Glu Phe Leu Pro 

545 550 ^^'^ 

ser ser Gly Gly Thr Phe Asn He Ser Val Ser Gly Asp He Asp 

560 565 
Gly Leu He Thr Gin Ala Leu Leu Thr Gly Asn Phe Glu Ser Ala 

575 580 
val ASP Leu cys Leu His Asp Asn Arg Met Ala Asp Ala He He 

Leu Ala He Ala lly Gly Gin Glu Leu Leu Ala Arg Thr Gin Lys 

605 ^-^^ 
Bys lyr Phe Ala Lys Ser Gin Ser Lys He Thr Arg Leu He Thr 

Ala val val Met HI Asn Trp Lys Glu He Val Glu Ser Cys Asp 

635 

Leu Lys Asn Trp Arg Glu Ala Leu Ala Ala Val Leu Thr Tyr Ala 

650 655 
Lys pro ASP Glu Phe Ser Ala Leu Cys Asp Leu Leu Gly Thr Arg 

665 

Leu Glu Asn Glu Gly Asp Ser Leu Leu Gin Thr Gin Ala Cys Leu 

680 

cys Tyr He Cys Ala Gly Asn Val Glu Lys Leu Val Ala Cys Trp 
^ 700 '^^ 

Thr Lys Ala Gin Asp Gly Ser His Pro Leu Ser Leu Gin Asp Leu 
710 

He Glu Lys Val Val He Leu Arg Lys Ala Val Gin Leu Thr Gin 
725 '30 '^^ 

Ala Met ASP Thr Ser Thr Val Gly Val Leu Leu Ala Ala Lys Met 
740 ? 

Ser Gin Tyr Ala Asn Leu Leu Ala Ala Gin Gly Ser He Ala Ala 
755 760 '^^ 

Ala Leu Ala Phe Leu Pro Asp Asn Thr Asn Gin Pro Asn He Met 

Gin Leu Arg Asp S Leu Cys Arg Ala Gin Gly Glu Pro Val Ala 

785 790 '^^ 

Gly His Glu ser Pro Lys He Pro Tyr Glu Lys Gin Gin Leu Pro 

800 oOd 
LYS Gly Arg Pro Gly Pro Val Ala Gly His His Gin Met Pro Arg 

815 ^20 
val Gin Thr Gin Gin oyr Tyr Pro His Gly Glu Asn Pro Pro ^o 

830 ^35 
Pro Gly Phe He Met His Gly Asn Val Asn Pro Asn Ala Ala Gly 

845 850 
Gin Leu Pro Thr Ser Pro Gly His Met His Thr Gin Val Pro Pro 

860 865 o/u 
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Tvr pro Gin Pro Gin Pro Tyr Gin Pro Ala Gin Pro Tyr Pro Phe 

gtyg 880 00:3 

Gly Thr Gly Gly Ser Ala Met Tyr Arg Pro Gin Gin Pro Val Ala 
890 89b juu 

Pro Pro Thr Ser Asn Ala Tyr Pro Asn -mr Pro Tyr He Ser Ser 
905 . 

Ala ser Ser Tyr Thr Gly Gin Ser Gin Leu Tyr Ala Ala Gin His 

Gin Ala ser Ser So Thr Ser Ser Pro Ala Thr Ser Phe Pro Pro 
935 940 ^'^^ 

Pro pro ser Ser Gly Ala Ser Phe Gin His Gly Gly Pro Gly Ala 

950 

Pro pro ser Ser Ser Ala Tyr Ala Leu Pro Pro Gly Thr Thr Gly 

965 

Thr Leu Pro Ala Ala Ser Glu Leu Pro Ala Ser Gin Arg Thr Gly 
980 

Pro Gin Asn Gly Trp Asn Asp Pro Pro Ala Leu Asn Arg Val Pro 

995 1000 iuu3 

Lys Lys Lys Lys^Met Pro Glu Asn Phe^Met Pro Pro Val Profile 

Thr ser Pro Ile^Met Asn Pro Leu Gly Asp Pro Gin Ser Gin Met 
1025 1030 iuo3 

Leu Gin Gin Gin Pro Ser Ala Pro Val Pro Leu Ser Ser Gin Ser 

1040 1045 
ser Phe Pro Gin Pro His Leu Pro Sly Gly Gin Pro Phe His Gly 
1055 1060 J.U03 

val Gin Gin Pro Leu Gly Gin Thr Gly Met Pro Pro Ser P^^^ Ser 
1070 1075 xwou 

Lys Pro Asn He Glu Gly Ala Pro Gly^Ala Pro He Gly Asn Thr 
1085 1090 a.u:?3 

Phe Gin His Val Gin Ser Leu Pro Thr Lys Lys He Thr 

1100 1105 ixxu 

pro He Pro Asp Glu His Leu He Leu Lys Thr Thr Phe Glu Asp 
1115 1120 J-J.^^ 

Leu He Gin Arg Cys Leu Ser Ser Ala Thr Asp Pro Gin Thr Lys 
1130 1135 J-i^feu 

Arg Lys Leu Asp Asp Ala Ser Lys Arg Leu Glu Phe Leu Asp 

1145 1150 
Lys Leu Arg Glu Gin Thr Leu Set Pro Thr He Thr Ser Gly Leu 
1160 1165 xx/u 

His Asn He Ala Arg Ser He Glu Thr Arg Asn Tyr Ser Glu Gly 
1175 1180 XJ.03 

Leu Thr Met His Thr His He Val Sfer Thr Ser Asn Phe Ser Glu 

X190 1195 
Thr ser Ala Phe Met Pro Val Leu Lys Val Val Leu -Hir Gin Ala 
1205 1210 1215 

Asn Lys Leu Gly Val 
1220 

<210> 14 

<211> 222 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2596566CD1 

Met^Ala^Phe Val Lys Ser Gly Trp Leu Leu Arg Gin Ser Oto lie 

1 5 10 

Leu Lys Arg Trp Lys Lys Asn Trp Phe Asp Leu Trp Ser Asp Gly 

20 25 
His Leu He Tyr Tyr Asp Asp Gin Thr Arg Gin Asn He Glu Asp 

35 ^0 
Lys val His Met Pro Met Asp cys He Asn He Arg Thr Gly Gin 

50 55 
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Glu Cys Arg Asp 1^ Oln Pro Pro Asp Gly Lys Sar I.ys Asp Cys 
Met Gin He vH cys Arg Asp Gly Lys Thr He Ser Leu cys 
Ala Glu ser Thr ASP ASP Cys Leu Ala Trp Lys Phe -Hhr Leu Gin 
ser Arg Thr A^n Thr Ala Tyr Val Gly Ser Ala Val Met Thr 



^ ^ 1^1 v^i qe^r Ser Pro Pro Pro Tyr Thr Ala Tyr 
Asp Glu Thr Ser Val Val Ser £>er ^^"^ i35 



Ala Ala pro Ala S Glu Gin Ala Tyr fly Tyr Gly Pro O^r Gly 



130 



Gly Ala Tyr Pro To Gly Thr Gin Val Vai Tyr Ala Ala Asn Gly 



145 



Gin Ala Tyr Ala ial Pro Tyr Gin Tyr P^o Tyr Ala Gly Leu Tyr 



160 



Gly Gin Gin pro Sa Asn Gin Val He lie Arg Glu Arg Tyr Arg 



175 



ASP Asn ASP ser Kp Leu Ala Leu Gly Met Leu Ala Gly Ala Ala 



190 



Thr Gly Met Ala Leu Gly Ser Leu Phe T|p Val Phe 

215 

<210> 15 
<211> 924 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 2685253CD1 



uits^'^^ ser Arg Gin Pro Pro Leu Val Thr Gly He Ser Pro 
Asi Glu Gly lie pro Trp Thr Lys Val Thr He Arg Gly Glu Asn 
Leu Gly Thr Gly Pro Thr Asp Leu He Gly Leu Thr He cys Gly 



His Asn cys Leu lIu Ihr Ala Glu Trp Met Ser Ala Ser Lys H^e 
val cys Arg Val oly Gin Ala Lys Asn Asp Lys Gly Asp He He 
val Thr Thr Lys Ser Gly Gly Arg Gly Thr Ser Thr Val Ser Phe 

Glu Lys lie Gly 
Met Asn Tyr Tyr . 
Pro Leu Ser Leu 
Lys Ser Lys Phe 

Met Leu Phe His cly Met Ser Ala Asp Phe Thr Ser Glu Asn Phe 

155 

ser Ala Ala Trp Tyr Leu He Glu Asn His Ser Asn Thr Ser Phe 

Glu Gin Leu Lys Ala Val Thr Asn Leu Lys Arg Gin Ala Asn 
1 S5 

nys Lys ser Glu Gly Ser Leu Ala Tyr Val Lys Gly Gly Leu Ser 



65 70 
Ser 

Lys Leu Leu Lys P^o Glu Lys He Gly He Leu Asp Gin Ser Ala 



val Trp val Asp gII Met Asn Tyr Tyr A^P Met Arg Thr Asp Arg 

Asn Lys Gly He Pro Pro Leu Ser Leu Arg Pro Ala Asn Pro Leu 

125 ju J vi 

Gly He Glu He Glu Lys Ser Lys Phe Ser Gin Lys Asp Leu Glu 



Thr Phe Phe Glu III Gin Asp Ala Leu Ser Ala He His Gin Lys 



215 

Leu Glu Ala Asp Gly Thr Glu Lys Val Glu Gly Ser Met Thr Gin 



220 225 



Lys Leu Glu Asn Ja! Leu Asn Arg Ala Ser Asn Thr Ala Asp Thr 



235 



245 



250 
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I.eu Ph. Gin Glu Val I,eu Gly Arg I^ys A|P Lys Ala Asp Ser Thr 
Ala Leu Val .eu Gin Arg Phe Ly. Phe Leu Phe Asn 
..u pro X.eu Asn fll Glu Arg Asn He Gin Lys Gly Asp Tyr Asp 
val val lie Asn ^ oyr Glu Lys Ala Lys Ser Leu Phe Gly Lys 
0^ Olu val Gin val Phe Lys Lys 0^ Tyr Ala Glu Val Glu Thr 
Xle Glu Ala Su Arg Glu Leu Leu Leu Asp Lys Leu Leu Glu 
0^ pro ser Thr Leu His Asp Gin Lys Arg Tyr He Arg Tyr Leu 
ser ASP Leu His fla Ser Gly Asp Pro Ala Trp Gin Cys Xle Gly 
Ma Gin His Lys S He Leu Gin Leu Met His Ser Cys Lys Glu 
Oly ^ val Lys Leu Lys Gly Asn Pro Gly Leu His Ser Pro 
Met Leu ASP Leu S Asn Asp O^r Arg Pro Ser Val Leu Gly His 
.eu ser Gin ^ HI Ser Leu Lys Arg Gly Ser Ser Phe Gin Ser 
Gly Arg Asp Asp S Trp Arg Tyr Lys Thr Pro His Arg Val Ala 
P.e val Glu Lys Leu Thr Lys Leu Val Leu Ser Gin Leu Pro Asn 
PUe Trp Lys Leu S He Ser Tyr Val Asn Gly Ser Leu Phe Ser 



Glu Thr Ala Glu Lyl ser Gly Gin He Glu Arg Ser Lys Asn Val 
Gin Arg Gin S Asp Phe Lys Lys Met Xle Gin Glu Val Met 
His ser Leu Val lys Leu Thr Arg Gly Ala Leu Leu Leu Ser 
Xle Arg Asp Gly Su Ala Lys Gin T^ Gly Gly Trp Glu Val Lys 
cys Glu Leuser lly Gin Trp Leu Ala His Ala Xle Gin ^ Val 
Leu Thr His gIu Ser Leu ^r Ala Leu Glu He Pro Asn Asp 
.eu Leu Gin Thr III Gin Asp Leu Xle Leu Asp Leu Arg Val Arg 
cys val Met Ala Leu Gin His Ala Glu Glu He Lys Arg 
.eu Ala Glu Lys III Asp Trp He Val Asp Asn Glu Gly Leu Thr 
ser Leu Pro Cys III Phe Glu Gin Cys He Val Cys Ser Leu Gin 
ser Leu Lys Gly vfl Leu Glu Cys Lys Pro Gly Glu Ala Ser Val 
Pl.e Gin Gin Pro lys Thr Gin Glu Glu Val cys Gin Leu Ser He 
He Met G3. V^l Phe Xle Tyr Cys Leu Glu Gin Leu Ser Thr 
.ys pro ASP Ala He Asp Thr 1^ His Leu Ser Val Asp Val 
ser ser Pro Asp III Phe Gly Ser Xle His Glu Asp Phe Ser Leu 
^ ser Glu Gin "I Leu Leu Xle Val Leu Ser Asn Cys Cys Tyr 
r^u Glu Arg His Phe Leu Asn He Ala Glu His Phe Glu Lys 
His Asn Phe Gin Gly He Olu Lys He Thr Gin Val Ser Met Ala 
ser Leu Lys Glu HI Asp Gin Arg Leu Phe Glu Asn Tyr He Olu 
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760 "^^^ 
.eu Lys Ala Asp Pr^ He Val Oly Ser .eu Olu Pro Oly He ^ 
Gly Tyr Phe S Trp fys Asp Cys .eu Pro Pro ^ Oly Val 
Xsn Tyr .eu iyl Glu Ala Leu Val Asn He He Ala Val His 
Glu val Pha He Ser .ys Olu .eu Val Pro Arg Val Leu 
ser .ys val He Ala Val Ser Olu Olu X.eu Ser Arg Leu Met 
Oln cys val Ser III P^e Ser Lys Asn Gly Ala Leu Oln Ala Arg 
.eu Olu He Cys Jla Leu Arg Asp Val Ala Val ^ ^ 
P.O Olu ser Lys HI ser P.e Lys oln Ala Leu olu Ala Leu Pro 
oln Leu ser Ser Sy Ala Asp Lys Lys Leu Leu. Olu Olu Leu Leu 
Lys P.e Lys |r Ser Met His Leu Oln Leu Thr Cys Phe Oln 

Ala Ala ser Ser Thr Met Met Lys Thr 
920 

<210> 16 
<211> 435 
<212> PRT 

<213> Homo sapiens 
<220> 

3r>Sl^l^0^o: 2762252CX>l 

^ft^M^ pro Pl.e Oly Arg Asn Leu Leu Lys OThr Arg His Lys Asn 
Arg ser Pro LyJ Asp Met Asp Ser olu olu Lys Olu He Val 
val Trp val Cys oll Olu Olu Lys Leu Val cys oly Leu .hr Lys 
A.g 0^ Thr ser £ Asp Val He Oln Ala Leu Leu Olu Olu His 
Glu Ala Thr Phe oly Olu Lys Arg Phe Leu Leu oly Lys Pro Ser 
ASP Tyr cys He ife olu Lys Trp Arg Oly Ser Olu Arg Val Leu 
P.O pro Leu Thr A?g He Leu Lys Leu Trp Lys Ala Trp Oly Asp 
Olu Oln pro Asn Met Oln Phe Val Leu Val Lys Ala Asp Ala Phe 
Beu pro val Pro HI Trp Arg Thr Ala Olu Ala Lys Leu Val Oln 
Olu Lys Leu Trp olu Leu Ser Pro Ala Asn Tyr Met Lys 
^ X.U Pro pro ^P Lys oln Lys Arg He Val Arg Lys Thr Phe 
Arg Lys Leu Ala S He Lys Oln Asp Thr Val Ser His Asp Arg 
ASP Asn Met Olu Z Leu Val His Leu lie He Ser Gin Asp Hrs 
^ lie His om o?n val Lys Arg Met Lys olu Leu Asp Leu olu 
.Xe Olu Lys cys lln Ala Lys Phe His Leu Asp Arg Val Olu Asn 
ASP Oly Glu Asn val Oln Asp Ala Tyr Leu Met Pro Ser Phe 
ser Glu Val Olu fin Asn Leu Asp Leu Oln Tyr Glu Olu Asn Gin 



18/44 



PCT/USOO/34919 

WO 01/46256 

250 2SS 
,eu Glu 7^ ser Glu Ser Asp Gly He Glu Gin Leu Glu 
.lu Arg Leu Lys Tyr Arg He Leu lie Asp Lys Leu Ser Ala 
Glu lie Glu Lys flu Val Lys Ser Val Cys He Asp He Asn Glu 
^sp Ala Glu Gly Glu Ala Ala Ser Glu Leu Glu Ser Ser Asn Leu 
Glu ser val Lys ^s Asp Leu Glu Lys Ser Met Lys Ala Gly Leu 
" .ys He His ser hIs Leu Ser Gly He Gin Lys Glu He Lys Tyr 
sar ASP Ser Leu S Gin Met Lys Ala Lys Glu Tyr Glu Leu Leu 
Ma Lys Glu Phe S Ser Leu His He Ser Asn Lys Asp Gly cys 
Gin Leu Lys Glu Arg Ala Lys Glu Ser Glu Val Pro Ser Ser 
Gly Glu lie Pro Phe Thr Gin Arg Val Phe Ser Asn Jyr 
^ Asn ASP Thr i^P Ser Asp ^r Gly lie Ser Ser Asn His Ser 
Gin ASP ser Glu 0^ Val Gly Asp Val Val Leu Leu Ser Jhr 



<210> 17 

<211> 321 

<212> PRT 

<213> Homo sapiens 

<220> 

3r>S?trS^:: 3452009CD1 

^t^Gli'phe Leu Leu Gly Asn Pro Phe Ser 0^ Pro Val Gly Gin 
Cyi Leu Glu Lys All -nhr Asp Gly Ser Leu Gin Ser Glu Asp Trp 
Leu Asn Met Glu He cys Asp He lie Asn Glu O^r Glu Glu 
Gly Pro Lys Asp Ala He Arg Ala Leu Lys Lys Arg Leu Asn Gly 
Asn Arg Asn ^1 Olu Val Met Leu Ala Leu Thr Val Leu Glu 
Thr cys val Lys Jn cys Gly His Arg Phe His He Leu Val Ala 
.sn Arg Asp Phe x?e Asp Ser Val Lou Val Lys He He Ser Pro 
Lys Asn Asn Pro H ^ He Val Gin Asp Lys Val Leu Ala Leu 
lie Gin Ala Trp lH Asp Ala Phe Arg Ser Ser Pro Asp Leu Thr 
Gly val val His III Tyr Glu Glu Leu Lys Arg Lys Gly Val Glu 
Pl.e pro Met Ala ^P Leu Asp Ala Leu Ser ..o He His Thr Pro 
Gin Arg Ser Val So Glu Val Asp Pro Ala Ala Thr Met Pro Arg 
ser Gin Ser Gin "in Arg Thr Ser Ala Gly Ser Tyr Ser Ser Pro 
pro pro Ala Pro Ser Ala Pro Gin Ala Pro Ala Leu Ser Val 
Gly pro He ^ Ala Asn Ser Glu Gin He Ala Arg Leu Arg 
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Ser Glu LOU Asp Val Val Arg Oly Asn gr r.ys Val Met Ser Glu 
Met .eu Thr Glu Me? Val Pro Gly Gin Glu Asp Ser Ser Asp Leu 
Glu Leu Leu Gin gJu Leu Asn Arg ^ Cys Arg Ala Met G3. Gin 

Xle val Glu ill lie Ser Arg Val Ser Asn Glu Glu Val Thr 
Glu Glu Leu Leu hIs Val Asn Asp Asp Leu Asn Asn Val Phe Leu 

0^ Glu Arg |p Glu pro Asp Phe Phe Phe Phe Phe Phe Pro 

Leu Lys Arg Leu Leu Pro 
320 

<210> 18 

<211> 499 

<212> PRT 

<213> Homo sapiens 

<220> 

:f23> ?icjEl^So: 4644780CD1 

rt°^o\la val ser Gly Pro Gly ^o Leu Phe cys Leu Leu Leu 

t.ei Leu Leu.Asp Pro His Ser Pro Glu O^r Gly cys Pro Pro Leu 

Arg Phe Glu Lys Leu Ser Phe Lys Gly Pro Arg Leu Ala 

Leu pro Gly Ala G^ly He Pro Phe Trp Ser His His Gly Asp Ala 

lie Leu Gly Leu Glu Glu Val Arg Leu 0^ Pro Ser Met Arg Asn 

^g ser Gly Ala Trp Ser Arg Ala Ser Val Pro Phe Ser Ala 

rrp Glu val Glu Va°l Gin Met Arg Val Thr Gly Leu Gly Arg Arg 

Gly Ala Gin Gly Met Ala Val Trp Tyr Thr Arg Gly Arg Gly His- 

val Gly ser Val ^ei Gly Gly Leu Ala ser Trp Asp Gly He Gly 

lie Phe Phe Asp HI Pro Ala Glu Asp Thr Gin Asp Ser Pro Ala 

Xle Arg Val L.u 111 Ser Asp Gly His lie Pro Ser Glu Gin Pro 

Gly ASP Gly Ala sir Gin Gly Leu Gly Ser Cys His Trp Asp Phe 

Asn Arg Pro hIs Pro Phe Arg Ala Arg He 0^ Tyr Trp Gly 

Gin Arg Leu Arg Jet Ser Leu Asn Ser Gly Leu Thr Pro Ser Asp 

pro ASP A^ His S val Leu Ser Phe Leu Thr Phe Ser Leu Ser 

Glu pro ser Pro S Val Pro Pro Gin Pro Phe Leu Glu Met Gin 

Gin Leu Arg Leu lla Arg Gin Leu Glu Gly Leu Trp Ala Arg Leu 

Gly Leu Gly Thr Glu Asp Val Thr Pro Lys Ser Asp Ser Glu 

Ala Gin Gly Glu III Glu Arg Leu Phe Asp Leu Glu Glu Thr Leu 

Gly Arg His Arg S He Leu Gin Ala Leu Arg Gly Leu Ser Lys 

Gin Leu Ala Gin Ma Glu Arg Gin Trp Lys Lys Gin Leu Gly Pro 
305 ■'■^ 
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pro Gly Gin Ala Arg Pro Asp Gly Gly Trp Ala feu Asp Ala Ser 
cys Gin Ilo Pro II" Oto P- Gly Arg Gly Gly His Lau Ser Met 
ser Leu Asn Lys ^P Ser Ala Lys Val Gly Ala Leu Leu His Gly 
om TTp Thr Leu lIu Gin Ala Leu Gin Glu Met Arg Asp Ala Ala 
val Arg Met Ala III Glu Ala Gin Val Ser Tyr Leu Pro Val Gly 
lie Glu His His III Leu Glu Leu Asp His lie Leu Gly Leu Leu 
Gin Glu Glu Leu Gly Pro Ala Lys Ala Ala Ala Lys Ala Pro 
^rg pro Pro Gly Pro Pro Arg Ala Ser Ser Cys Leu Gin Pro 
oiy lie Phe Leu Phi Tyr Leu Leu He Gin ^r Val Gly Phe Phe 
Oly val His itl ser Arg Gin Glu Leu Asn Lys Ser Leu Gg 
Glu cys Leu ser S Gly Ser Leu Pro Leu Gly Pro Ala Pro His 
pro Arg Ala ilS Gly He Leu Arg Arg Gin Pro Leu Pro Ala 
485 

Ser Met Pro Ala 



<210> 19 

<211> 879 

<212> PRT 

<213> Homo sapiens 

<220> 



-c991> misc f eatxjre 
323>^cyte ID No: 4946103CD1 

Mft°^'Ala lie Lys His Ala Leu Gin Arg Asp He Phe Thr Pro 
Asi ASP Glu Arg Leu Leu Ser He Val Asn Val Cys Lys Ala Gly 
Lys Lys Lys Lys A^n Cys Phe Leu Cys Ala Thr Val ^r Glu 
^vrg pro Val Gin vll Lys Val Val Lys Val Lys Lys Ser Asp Lys 
Gly ASP Phe Tyr Lys ^g Gin He Ala Trp Ala Leu Arg Asp Leu 
Ma val val Asp Al Lys Asp Ala He Lys Glu Asn Pro Glu Phe 
.eu His Phe oil Lys He Tyr Lys Trp Val Ala Ser Ser Thr 
Ala Glu Lys Asn a!I Phe He Ser Cys lie Trp Lys Leu Asn Gin 
Arg Tyr Leu Arg Lys He Asp Phe Val Asn Val Ser Ser Gin 
.eu Leu Glu Glu Ser Val Pro Ser Gly Glu Asn Gin Ser Val gr 
Gly Gly ASP Glu gJu Val Val Asp Glu T^r Gin Glu Leu Asn Ala 
Arg Glu Glu Gin S He Glu He Met Met Glu Gly Cys Glu Tyr 
Ala He ser Asn Ala Glu Arg Phe Ala Glu Lys Leu Ser Arg Glu 
I.eu Gin val Leu ^p Gly Ala Asn He G^ Ser He Met Ala Ser 
Glu Lys om val He Leu Met Lys Leu Leu Asp Glu Ala Leu 
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.Xu V.1 ^ 31. I" I*u Ly. ^ ser s.r ^ =lu 31U 
Met om ^ S Ly. 31n Hat ^ =ln 11. S«: olu |» 
^. HIS ... XI. f - «r ^ - -X .ys r-. 
S.r OIU n. 01„ Ph° L» V.1 His M.t «p I.™ Al« «| 

His Xle AX. OX. .!« cxy ^ Ala ser =.r Ar. .ly 
lie .lU AX. Cys AS. Al. Al. Asp Ala ^ ^ ^l" Cy «.t 
^ VaX AXa «o OXy His A=p ^ AX. V.l Ly. 

OX. OX. OX. «5 M.e s„ ASP .-u «g OX» L.U P>.. Ala Ar| 
... AXa s.r "il A^ AS. val ||. V.l aX. GXy His 

«p ax. s« HI ^? '"^ =" SI 

™, 01= s.r S n. S.r Gin AT, P|| Pro Pro II. Al. Lys 
^ „.t OX. TTP as .ys ASP jyr oxy nys Tyr ox. oly 

Tl. .ys «. ^ M.t ASP ^ -I A., ox. AT, 

ox. XX. .ys ASP P.. ox. V.X AX. jys il. .ys 

^ ^ .ys oi. S .ys .ys P^. 3iy J- »i= 

^. ^ ^ oxy s.. s.r «. AS. i.y» X... V.X 01. 

S„ s« oxy AS. S AT, ox. s» =.r S» 1^ ASP «.t 
oxy AS. S.r £^ S„ ASP «u ASP V.X AX. Asp A., ^ .y» 
P.. ASP -ys IX. Z ol. ol. V.1 L~ OX. I.™ ox. pro 

cys !,» AX. OX. I°X° ASP P.. II. SI 
OXB HIS OX. s.r S^t pro OXy Thr M.t Al. OXU AXa OX. Asp |.. 
^p oxy oxy ^ s.r AT, ox. His As. cys oxy Pro Lj. 
P,o V.X s.r s.r IS .ys ASP H.t IX. Ar| OXn M.t Hat IX. Lys 
IX. P^ A„ cys OX. pro OX. L«. As. As. H. Ala |» 
oxy ASP Ly. II. S =.r As. s.r ... Tyx «.t I-" V.l 
„st s.r His His ^ Al. ol. As. V.l Asp Pro Al. s„ 
^^S.r'^"^'^ Oly AS. v.1 .5. V.X T..: VaX Lys Ar| 
^ P.. Lys IX. ..r AS. OX. IX. Axg 01. «.t ox. oXu 
V.X Ly. IX. S« l" Ly^ =.r Lys Val oly U. L« ^» 
^ 01. P.» 01. PI.. AX. oxy L» Al. OX. s«: II. Pk. 
^ M. OX. AT, S oxy ASP ... ASP Lys »1. Tyr XX.: .y» 
II. AT, oxy val Z V.1 AS. Val OX. Lys val Ala AS. Ol. S.r 
01. Lys •«» pro S ASP V.1 V.l «.t 01. As. P.. His His 
II. Pta AX. ■J^ IS s« Arg L«. Lys II. S«: cys OX. Al. 
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735 

alu r.y. Lys olu «a Lys cl» J.ys Ty. S »«. His I-u 01» s.r 
OVr V.1 lU ^ Oly OU. pro ^ 01» Ly. L.u Hl| 

P^e Glu Oly i'l 01» "9 ^% ''^ ?M 

Giu =lu V.1 s» 3S a» ?lf 
v.1 II. Lys «o <=W -ys «u v,i .ys fys =iy |» 

^ ^ ^ ..ys v.1 ^ys »i| St 

^ Gin S'l v.1 TTP His Het 01. =lu Pbo 11. 

.1. ^ Ly. S^^ P^. 5S »■ "9 «^ HI 

Oly S„ Oly V.1 H.t GIU Phe Thr U. om A=P ^ 

ovr cys Ser Ser lie Ala Gin Ser His 
^ 875 

<210> 20 

<211> 298 

<212> PRT 

<213> Homo sapiens 

<220> 

?ic5£f S"o: 5562355CD1 

Mft°Lp°Asn Ala aiy I.ys Glu Arg Glu Ala Val Gin Leu Met Ala 

1 ^ -■ ^ -M^ ao-r His Ser Phe Leu Arg Gly 

Glu Ala Glu Lys Arg Val Lys Ala Ser Hx| Ser 

neu Phe Gly Gly a2S ^ Arg He Glu Glu Ala Cys Glu Met 

^ Arg Ala Ala Met Phe Lys Met Ala Lys Asn Trp ser Ala 

Gly Asn Ala Phe Cys Gin Ala Ala Lys Leu His. Met Gin Leu 

oln ser Lys His A^P Ser Ala ^r Ser Phe Val Asp Ala Gly Asn 

Ma Tyr Lys Lys A^a Asp Pro Gin Glu Ala lie Asn Cys Leu Asn 

Ma Ala lie Asp III Tyr Thr Asp Met Gly Arg Phe Thr lie Ala 

_^ -r-i TM= rill Tie Tvr Glu Thr Glu Leu 
Ala Lys His His He Thr He Ala Glu lie lyr 

val ASP lie Glu III Ala He Ala His Tyr Glu Gin Ser Ala Asp 

Tyr Tyr Lys Gly Itn Glu Ser Asn Ser Ser Ala Asn Lys Cys Leu 

.ys val Ala III Tyr Ala Ala His Leu Glu Gin Tyr Gin Asn 

Ma He Glu He Glu Gin Val Gly Ala Asn Thr Met Asp Asn 

pro Leu Thr Thr Ser Ala Lys Asp Tyr Phe Phe Lys Ala Ala 

Leu cys His Phe fie Val Asp Glu Leu Asn Ala Lys Leu Ala Leu 

Glu Gin Tyr Glu Met Phe Pro Ala Phe Thr Asp Ser Arg Glu 

cys Lys Leu Leu lJs Lys Leu Leu Glu Ala His Glu Glu Gin Asn 

ser Glu Ala Tyr ?S Glu Ala Val Lys Glu Phe Asp Ser He Ser 
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260 265 270 

Arg Leu Asp Gin ^ Leu Thr Thr Mat Leu Leu Arg He Lys L^ 

ser lie Gin Gly 2p Gly Glu Gly Asp Gly Asp Leu Lys 

290 "^^^ 

<210> 21 

<211> 941 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 5678824CD1 

Me? Mf Ala Tyr Leu Gin Txp Arg Arg Phe Val Phe Phe Asp Lys 



Gli Leu val Lys Glu Pro Leu Ser Asn A^p Gly Ala Ala Pro Gly 

on ■"•^ 
Ala Thr Pro Ala Ser Gly Ser Ala Ala Ser Lys Phe Leu Cys Leu 

pro pro Gly He Oto: Val Cys Asp Ser Gly Arg Gly Ser Leu Val 

Phe Gly ASP Met Glu Gly Gin He Trp Phe Leu Pro Arg Ser Leu 

Gin Leu Thr Gly pS Gin Ala Tyr Lys Leu Arg Val Thr His Leu 

Tyr Gin Leu Lys Gin His Asn He Leu Ala Ser Val Gly Glu Asp 



Glu Glu Gly He A^n Pro Leu Val Lys iie Trp Asn Leu Glu Lys 
Arg ASP Gly Gly Asn Pro Leu cys Thr Arg He Phe Pro Ala He 



Asn Leu Asn Pbe III Ala He Gly Phe ^ Asp Gly Ser Val Thr 
Leu Asn Lys Gly S He Thr Arg A.p Arg His Ser Lys Thr Gin 



125 130 
pro Gly Thr Glu Pro Thr Val Val Ser cys Leu Thr Val Hxs Glu 

Thr 
150 
Arg 

170 175 
He Leu His Lys Gly Asn Tyr Pro Val Thr Gly Leu Ala Phe Arg 

1 85 J-"u 
Gin Ala Gly Lys Thr Thr His Leu Phe Val Val Thr Thr Glu Asn 

val Gin ser Tyr He Val Ser Gly Lys Asp Tyr Pro Arg Val Glu 

Leu ASP Thr His Gly cys Gly Leu Arg Cys Ser Ala Leu Ser Asp 

pro ser Gin Asp Leu Gin Phe He Val Ala Gly Asp Glu Cys Val 

Tyr Leu Tyr Gin ^o Asp Glu Arg Gly Pro Cys Phe Ala Phe Glu 

Gly His Lys Leu He Ala His Trp Phe Arg Gly Tyr Leu He He 

275 Zou 
val ser Arg Asp Arg Lys Val Ser Pro Lys Ser Glu Phe Thr Ser 



290 — 
Arg ASP Ser Gin Ser Ser Asp Lys Gin lie Leu Asn He Tyr Asp 



^ 7<^r. TWO Tie Ala Tvx Ser Thr Val Phe Glu Asp Val 
Leu cys Asn Lys Pne i±e Axa xyx ^30 



310 



320 

val ASP val Leu Ala Glu Trp Gly Ser Leu Tyr Val Leu Thr Arg 



325 



335 

Asp Gly Arg Val His Ala Leu Gin Glu Lys Asp Thr Gin Thr Lys 



340 



Leu Glu Met Leu Phe Lys Lys Asn Leu Phe Glu Met Ala He ^n 



355 



Leu Ala Lys Ser His Leu Asp Ser ^p Gly Leu Ala Gin He 



370 



24/44 



PCT/USOO/34919 

WO 01/46256 

385 

P^e Met Gin Tyr lly Asp His Leu Tyr Ser Lys Gly Asn His Asp 
Gly Ala Val Gin gS Tyr He Arg l^hr lie Gly Lys Leu Glu Pro 
ser Tyr Val He t^l Uys Phe Leu Asp Ala Gin Arg He His Asn 
r.eu Thr Ala Tyr Su Gin Thr Leu His Arg Gin Ser Leu Ala Asn 



Ala ASP His Thr Leu Leu Leu Asn ^s Tyr Thr Lys Leu Lys 



ASP ser ser Lys S Glu Glu Phe He Lys Lys Lys Ser Glu Ser 



^ t^^ Tr=»i mil Thr Ala lie Lys Val Leu Arg Gin 
Glu val His Pile Asp Val Glu Ttir Axa xxe uy 



Ala Gly Tyr Tyr Ser His Ala Leu Tyr Leu Ala Glu _Asn His Ala 



490 



His His Glu Txp ^ Leu Lys He Gin Leu Glu Asp He Lys Asn 



520 



Tyr Gin Glu Ala Leu Arg Tyr Ha Gly Lys Leu Pro Phe Glu Gin 
Ala Glu ser Asn Me? Lys Arg Tyr Gly Lys He Leu Met His His 
lie pro Glu Gin ^ Thr Gin Leu Leu Lys Gly Leu cys Thr Asp 
oyr Arg Pro Ser Leu Glu Gly Arg Ser Asp Arg Glu Ala Pro Gly 
cys Arg Ala Asn S Glu Glu Phe He Pro He Phe Ala Asn Asn 
pro Arg Glu Leu lJs Ala Phe Leu Glu His Met Ser Glu Val Gin 



pro ASP ser pro III Gly He Tyr Asp Thr Leu Leu Glu Leu Arg 



Leu Gin Asn Trp 2a His Glu Lys Asp P^o Gin Val Lys Glu Lys 



I^u His Ala Glu S He Ser Leu Leu L^^ Ser Gly Arg Phe Cys 



640 



655 



ASP val Phe ASP gs Ala' Leu Val Leu Cys Gin Met His Asp Phe 
Gin ASP Gly val Leu Tyr Leu Tyr Glu Gin Gly Lys Leu Phe Gin 
Gin He Met His His Met Gin His Glu Gin Tyr Arg Gin Val 



lie ser Val Cys gIu Arg His Gly Glu Gin Asp Pro Ser Leu Trp 



Glu Gin Ala Leu llr Tyr Phe Ala Arg Lys Glu Glu Asp Cys Lys 
Glu Tyr val Ala S Val Leu Lys His Il| Glu Asn Lys Asn Leu 
Met pro pro Leu leu Val Val Gin Thr Leu Ala His Asn Ser Thr 
Ala Thr Leu Ser Val He Arg Asp Leu Val Gin Lys Leu Gin 

IlG Ala Gin Asp 
^ Arg Glu Glu Thr Arg He Arg Gl^ Glu He Gin Glu Leu 



715 



r.ys Gin ser Gin Gl^ He Ala Gin Asp Glu Leu Arg Val Arg Arg 



.ys Ala ser Pro lys He Phe Gin Lys Thr Lys cys Ser He Cys 
Asn ser Ala Leu "lu Leu Pro Ser Val His Phe Leu cys Gly His 



ser Phe His Gin His Cys Phe Glu Ser ^ Ser Glu ser Asp Ala 



ASP cys pro Thr Leu Pro Glu Asn A^g Lys Val Met Asp Met 



850 



lie Arg Ala Gin lln Gin Lys Arg Asp lJu His Asp Gin Phe Gin 



865 



875 
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His Gin Leu Arg Cys Ser Asn Asp ge ser Val He Ala Asp 
^ Pl.e Gly Arg lly Val Phe Asn fys Leu Thr Leu Leu ^ Asp 
P.O pro Thr Ala |g Leu Tl^ Ser Ser Leu Glu Ala Gly Leu Gin 
ASP Leu Leu |t His Ser Arg Arg Gly Thr 

<210> 22 

<211> 336 

<212> PRT 

<213> Homo sapiens 

<220> 

3l3:?iS^?xn^o: 5870.62Cni 

S°Sef Lys ser Val Pro Ala Phe Leu Gin Asp Glu Val Ser Gly 
ser val Met Ser Va! Tyr Ser Gly Asp Phe Gly Asn Leu Glu Val 
r^ys Gly Asn lie G?n Phe Ala He Glu Tyr Val Glu Ser Leu Lys 
Glu Leu His val Phi Val Ala Gin Cys Lys Asp Leu Ala Ala Ala 
;.p val Lys Lys oln Arg Ser Asp Pro Tyr Val Lys Ala Tyr Leu 
.eu pro ASP Lys G^ly Lys Met Gly Lys Lys Lys Thr Leu Val Val 
.ys Lys Thr Leu A^n Pro Val Tyr Asn Glu lie Leu Arg .Vr Lys 
lie Glu Lys Gin III Leu Lys Thr Gin Lys Leu Asn Leu Ser lie 
^ His Arg ASP Phe Lys Arg Asn Ser Phe Leu Gly Glu Val 
Olu Leu ASP Leu Su Thr Trp Asp Trp Asp Asn Lys Gin Asn Lys 
cm Leu Arg Trp ^ pro Leu Lys Arg Lys Thr Ala Pro Val Ala 
.eu Glu Ala Glu S Arg Gly Glu Met Lys Leu Ala Leu Gin Tyr 
val ^o Glu pro pro Gly Lys Lys Leu Pro Thr Thr Gly Glu 
val His lie rrp Ja! Lys Glu Cys Leu Asp Leu Pro Leu Leu Arg 
Gly ser His Leu Ser Phe Val Lys cys Thr He Leu Pro Asp 
0^ ser Arg Lys fe^ Arg Gin Lys O^r Arg Ala Val Gly Lys Thr 
^ Asn Pro He III Asn His ^r Met val Tyr Asp Gly Phe Arg 
pro Glu ASP Leu M^t Glu Ala cys Val Glu Leu Thr Val Trp Asp 
His Tyr Lys Leu Asn Gin Phe Leu Gly Gly Leu Arg He Gly 
Phe Gly Thr Gly Lys Ser Tyr Gly Thr Glu Val Asp Trp Met Asp 
ser Thr Ser Glu III Val Ala Leu Trp Glu Lys Met Val Asn Ser 

pro Asn Thr Trp IH Glu Ala Thr Leu Pro Leu Arg Met Leu Leu 

320 -^^b 

lie Ala Lys He Ser Lys 
335 

<210> 23 
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<211> 163 
<2X2> PRT 
<213> Homo sapiens 

<220> 

?223:^c^?XD^o: 28X8605CD1 

mT^'asp Lys Glu pro fys Arg ^ Pro Thr Leu Arg Asp Arg 
I^i cys ser Asp GlJ Phe Leu Phe Pro Gin Tyr Pro He Lys Pro 
Tyr His Leu Lys Gly He His Arg Ala Val Phe Tyr Arg Asp Leu 
Glu Glu Leu Lys Phi Val Leu Leu thr Arg Tyr Asp He Asn Lys 
^g ASP Arg Lys Glu Arg Thr Ala Leu His Leu Ala cys Ala Thr 
Gly Gin pro Glu mS Val His Leu Leu Val Ser Arg Arg cys Glu 
I,eu Asn Leu Cys ^ Arg Glu Asp Arg Thr Pro Leu He Lys Ala 
val Gin Leu Arg G^ln Glu Ala Cys Ala Thr Leu Leu Leu Gin Asn 
Gly Ala ASP pro He Thr Asp Val Phe Gly Arg Thr Ala Leu 
His ^ Ala val Asn Glu Asp Thr Ser Met He Glu Lys Leu 
I.eu ser His Gly ^ Asn He Glu Glu cys Ser Lys Val 

1 XOU 



155 

<210> 24 

<211> 1983 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc feature 

<223> incyte ID No: 381039CB1 



gSgg^g ccgggcgtgc ggaggcggct ^tgaggcgg ^glg^- '120 
gggtlattgc tgactagcgg p^ggg^g^g gg=^g^g^^| gcgiaggcgg cctcggaggt 180 
IccgSgcccg ggcggcgggg tcagcggggg Hlf^lll^ IH^ILII gccactgctt 240 
gtggtgccgt =gggtg°ggg lUl^lH ggltatcaco gtgilcagct tcgtgtgcac 300 
cgagtgcgcc cagcgcgggg tcacccacgc 93^^^ >^ catgtcaagt caatctccat 360 
cacctgcLc ggcctcctga g^ggg^tg-^ cctgcaatS cgt|gaaatg aggtttgccg 420 
gacaactttc actgagcctg aagtagtatt ^IIHH gtaccagatt ccagggatcc 480 
laagatttgg ttgggtctgt ttgatgctcg g^^^^^^^^ lagagatggt atgtcccccc 540 
tcalaaagtg aaggagtttc tccaggaaaa ^^atgagaag ^agag^ jg ^^^^^^^^^ ,oo 
agaccaagtc aaggggccca cttataccaa ^ggcag^g^ cctgcaccgt ctctctcagt 660 
citcccagaa gggaagcccc ttcggacact tctgggtgat J^^^^ cccaggcccg 720 

tgctgcctcc acctcgagcc agcccgtcag l^^Hl Ltactgacc tgctggctga 780 
gigcactcag ccacctcccc actcctctgt °aaa^gcc agt 9 cattccctgc 840 
catcggtgga gacccctttg ctgcacccca g^tggcacca ^ ^ I ttagcagtgg 900 
ctttgggggc cagacacctt cccaaggagg ftttgccaac "^3^ ^ tccaggccca 960 
ccccSctIt tctgtgtttg gaagcctccc tccagctggt caagcctcgt tc sg^^^^ ^^^^ 
gccaactcct gcagggagoa gccaggggac tccatttggt 3cc^ tgcccgctgc 1080 

cagtcagcoa aacagcctcg cagacgtggg ^^^="^^^0 cicccgctcc agtctgtcac 1140 
aggtgttcct agcagcctct tcgggatggc tgg^^^gg^ ggagclttca ctaacccttt 1200 
gltgggcggc ggcggcggca goagcacagg fHHll^l ccgttccagc ccaatggctt 1260 
laclgctccc gccgcccagt ccccgctgcc ttccaccaac ^^gt 9 ^gcc 1320 

ggcgccaggg cccggctttg ggatgagcag tg^^gggcct gg^ agaccccgct 1380 

^Lcactggg gcctttgcca gctccttccc agcaccgctg ttccccc^g^ t^ggcagag 1440 
tgttcagcag ^agaatggct cttccttcgg ggacttagg^ atglctggac cotcatcaag 1500 
gccactgagc cagccagctg ggacctccac caa<- 
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cccattccc tccaaacctc caaccaccaa ccccttcttg tagcactgtg ttttgggg jseo 
gcctcttccc tgccttctgg ggcccctctg ^tccctagag ^^t ggggcctaca 1680 

tgggcatttc tatgggcctt gggg^tggtg g^ggtgctaa tgctttgc^^ ^I|?cacccc 1740 
ggtgaaaggt ggctgccctc agattccaca aagcctctct agtgatggtt 1800 

caJcaggca ggaagcccag gagg^gtgcg mH^lf ||^^g?St ccctgggcct 1860 
gagggcggag gatttttttc acatgatcag t«°«9^g gggaaagtag gggaaagatg 1920 
3»g tfXeSc SSS ^S&l ?fccagLgg ccoatgtggc 1J80 



caa 

<210> 25 

<211> 1559 

<212> ENA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> incyte ID No: 383249CB1 



:cca;cgaga gatctgcagc taggctggct gcacttgctc cac^g^cag ggg^tcg^ 120 
ggggattgaa gaatgcgcca ttaaaaggaa agatcaagga gt^aaccaga ^g 3 
allgagglct tcaaagctgg gaaggatgag ttcttgcagc ^acgtctgtg gg^ 
ggcacaggct gcagctgagg gtgsttacca g^g^^^gga g^cgg ^ tccaaag 300 
gttttatgag gactgtacag ^^^^^atttg ^|9tatgag 9 5 gtacagtttt 360 

atcacctaac aggtggagct cagtattctg S^^ggccgga aaatcgaagc 420 

tgtgatcctc ggattgactg ttctggcagt gggctnccnc S9 acagtgctct 480 

atttggcgaa gccgattttg tggtggtcga -^^acatgct gtccagttt^ ccatg§cagg 540 
ggacatgtac aagctggcag g^g^tgttct =ttctgc^c ^^^l^^^l^ tcctccagca 600 
gtgcctgctg atgtcggtgt ttgtaaagag ^^^^^^^^ cagccggtta caaaagctcc 660 
gaagtttaaa gaacgaatcg cagacatcaa ^gcccacacc cagccgg^ tccagcctct 720 
agglccaggg gaaacaaaga ttccagtcac t"g°«g ?tga???^tg cccgtggtta 780 
actggcaacc tgaaaccttt c-^--^^^^^ agatttggcg tSStgccc tagggctgtg 840 
aaaagagcag gccagttttc S^gataaaga ^g^|^^g^9 caittcaggc agctctgctg 900 
ttcagctgtg ggcaatataa tggftggact ^^cacctgcc 3 ^^.^ tgggtgcttc 960 
gagggcggtg ccatgcctaa Qcctgtgcac |tgcarccag 3 gactttgtgt 1020 

cttgtacccg gccaccagaa aacccctgga ^ctcctctcc | « tggotaatgt 1080 
tattttccgt gttgtttgaa agtgccgaac ^gtttagacc agotcac ^^^^^^^^ ii40 
ggtccatttt attttctaat attggtttat ^t^atgtttt ctgcgcgg g ^^^^^^^^ ^200 
ttoctgtctc actatgtgta agattttgtc ^tatgtgttc ataga ^^^^^ttgga 1260 

gattttgagc cgaacatgag tttttagact gtatcttgac ^^g 99 cctttgagat 1320 
tacaaagtta tcagcattca caaatctttt ^3"""°^ Stcatagcg cactgcagcc 1380 
gaagtgctgt tcttttgcct aggttggagt acagcggtga ^^^^^^1 gggactacag 1440 
?tglactclt gggctcaagc gatcctccca cctcagcctc ccaagtagct ggg^^^^ 9 

?Sc2lttt l^lll^ gctSStS Ltct?gIL aaaaaaaaa 1559 



<210> 26 

<211> 1643 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 618769CB1 



<400> 26 -aartoartfTHc tactaoctat ttttccaaaa 60 

gaaatcatgc ccctcgtaga gcagcaggtc caagcagggc ^gccgg ttttgt 120 

Igtgaggcag tttaaaaaaa aggcggagaa ^tagaattat ^g^t^g^f tgtgttttic 180 
gtatttgtaa aactaacggc ttgcatggtt cacaacccat ttcttatgcc ^9^9 ^40 
ttggcagcaa aatttctgtg gttcctccta ctccacctcc tg^f^gcgag ^ 300 
gccgcagtcc tggcaggaag gtcagtgcac cagatattct g^^ctctc 
Itcccalatg ctctactaac cctattttga agcaacagaa tctcccatcc ag 9g^^^ 

SSS^ gggcS??? S» « gaaiag^ ^ 
SS^Sga ^aS^alSl alSS^?! S^SS^ ScftlcctJ ttgcccgaca 600 
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gatcctccca gggaaagcgc aa^actggag tccaaaggaa caccgtggac cgacctttc 660 
aggagacctt gaagtatcag gtggcccctg ^ccagctggt 9^ « « atcattcctc 780 
cggtgtggca tctgggcacg ctggcccgga g^gg^tct tggaga g^a ccgctccggg 840 
tllccacgtg ggactttgaa g^^^gcacaa ^acagtcctt ^^g^g^^c acagtccggg 900 
ccaaggcgga gaaatacgaa gacagcgttc ^tcagagcaa « ^ gggacagatc 960 
ctaalctggt tctcccttca cggcccagaa ^^^^ gccaaglat ?tLctgtgc 1020 
agccatcact tcatggtcaa ctttgtttgg ^agcgccagg ^ gaccaacaaa 1080 

ggccagatgg caccttgaac tcatttgtta ^gggctgtct ^ac |^ aaacactcaf 1140 
lactgagact gaagtcgcca gt^ctgagga ^g^^99^"| gagctt|gag ttaactgtct 1200 
ttgtcttcag tggegtaacc ccagctcagc tg^ggcagtc S^g ay cttggttcaa 1260 
gglatcaggc cctctttgga atgaatgacc g^ttgtgg tggcagaaag 1320 

agggagacac agctgttggc gggg^tg^at g^tcactatc ^ catgaaggcc 1380 

tcctttccag ccccaatcta tggacagaca tg^ctcttgt « gctacgaacc 1440 

tcaaggttcc aggttgcagc aggcgtgagg ^^ctgtgcgt ^tgc g |ga » ^ ^^^^g. ^500 
aggtgcaggg tcccagctgg ^gaccccttt g^^ttgagc ^^ ^^^^^ aattatgtta- 1560 
r/taaSlfc SSSS S?fcfct?t SStgtaga alatggccag attttaataa 1620 
acgttgttac ccaaaaaaaa aaa 



<210> 27 

<211> 1613 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature „^„„„, 
<223> incyte ID No: 1234837CB1 



<400> 27 t.„r,rrt- aaorttittca accgcaagaa gcagagccgc 60 

cccgggccag gggcgggcgc cgccatgggt ^^e!^"cg 9 J 9 «^ gctgaggcag- 120 
gtcacggagc aggacaaggc catcctgcaa =tg^9°^^^ ccctigcocg gcagctgctg 180 
taccagaaga ggatcgccca scagctggag ^g°g^9°9°| Igaagcgata ccaggagcag. 240 
cgggacggca ggaaggaacg gg^^^^^^g ^tS^tcaaga ag 9 9^^^ gagtattgag 300 
ctcctggaca ggacggagaa c^agatcagc ^gcccggagg gtgtctgaac 360 

ttcacccaga tcgaaatgaa agtgatggag g99ctgcagt "99 gacgcaggag 420 

aagatgcacc aggtgatgtc cattgaagag 9tggagagga 9S tcaggaggat 480 
gccgtggagt accagcggca aatagacgag ^tcctggcag 9^9^^ gctgccagag 540 
?aa|aScca tcctggagga gctg9cgca atcactcagg aacaaa^^g^ ^^.eaggccc 600 
gttccctccg agccccttcc tgagaagacc ^«agaa y tcttgtggga ctcacgggga 660 
Iggcaggcgg agctggtggc agcttcgtaa ^g^gf^^^cg 111%^%^ accgggttcc 720 
tgccccaggg actgtggccc acagagagtt tgggcacgg 9 attgcaggag 780 

ctggagccca gtgcgcacgg tgc*^9agcag agctgcagcc 9 actggcctgc 840 

gactccagag cgtctcctgg agaccttgag ^ctgaacgca ^^^agg a 900 
?ctcagtccg gattaactct cgaccgagcc cagcttctgc cggtg^ggg ^^^^^^^.g^ ,^0 
ggccgaggcc caggcccaac gcctctggtg ^cgcccccc « « ttcctcttgc 1020 

Itcgcgctcg tctgtgagga agacacctcc agaccttggg 9^|^ cttggcctgg 1080 

tcctcgctgc tcccattagc tggtgcaggc ttccgccaag 9ga cagcatgtgc 1140 

cttcciggog cacctcagct tccctgctgg tgg9gg9atc ^^^^^^ag | ^ggc 1200 
tgaacctctc tgtgcctctg cctccgcacc ctagacaccc a^c a gccttgccac 1260 
Scagggg ctttcttgct gaattgacga ^tcc^gagc ^|tgactccc 9 9 ^^^o 
tclcggltct gtccactagg g^^tcagccct g^tgagaaag gacctccga^ |gctg|ggct 1380 

IZ^l ISISISI |g ~3 

MS issss ss| 3 g ?H£s IMS 

ctcgaccccc agacaggccg gcccgtatta aagttggccc cgca g 



<210> 28 

<211> 2810 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc„feat\ire 

<223> Incyte ID No: 1607223CB1 

<400> 28 
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taca.gttgg gcattgct.g gcactgat. tagg^cgcct e tatct cccatgggag 60^ 
tgatgaccag- tcccagaggc ctgtgtgact ^tgaatctcc ^ gagggaggcc 180 

tcccttctga aagcccaccc atgggcttct 99=«°°^9" t^tgcaca ggtgctggaa 240 

sssi ^^'^ s™ ~ sbs Sis? 

.tlg=ag.gg cotaocjga.: gctj™«g ^SStS «ttoalg.t 540 

^^"S? s^ss |S|it cSca|=- sr^s 12s 

sfi = 
iss? g«s ss?s g g ^ - 
«s ydg^si sisq sss»| i^^ 
ssss i»s s« I ig ISii ~ isss 

gcctcagttt ccctgcttga cgacgagctc atgtctctgg S^ctcagtga ^ c ^^^^ 
Icttcaggcc caagcctgga tggtaccgga t^g^c^gct t^cagtcgtc 3 ^^^^ 
gagcccccag cccctgctct gg^^caggcc ^^^^Qtatgg ^^^fccgacc ^^^^^^^^ ^330 
acatccctgc cagcaagcag cggtctggac gacctagacc ^cotgggg « j^jgo 
cagcagtcgc tgcccccgga atcccagcaa gt^^ggtas^ aSactcccc cagctccagc 1440 
cggctcacac tccgggacct gcagaataag ^f^agcagct g^^gctcccc ^ag 9 ^^^^ 
gccaccagcc ttctccacac cgtgtcccca gagcccccca g5°^tccg^ |^ ^ggQ 

ccaaccgage tctcactggc cagcatcact ^t^c^^^gg agtccatcaa acc g ^^^^ 
atcctgcccg tgactgtgta tgaccagcac gs^ttccgca t^^tcttcca | |« ^ggo 
gacccactgc cagggcgctc cgacgtgctg gtSSt^S^gg "tccatgcc 9 | " ^740 
ccccagccca tccgcaacat cgtgttccag t^g^g^cc ^caaggttac g^gg^S^^s ^^^^ 

r«? iss^ii s |gt gc.^ Ills 

iSftSS S« ggg M» ~?| ISIS 

iss « IS S SI ^« i"S 
^sSc stSo sEss SHs £SS 

SSSJ ^SISSI SSS^P Sfo g S g t«tgga |,J0 

aggclcagcc ataactcggg ggccatgctg gagctgggg^ lUlUl^l ctggtgtgtg 2460 

isiSgS Sill 1 Sii i?;s 

-S' "t S 13 gJgSg gtg.gtg.g. 

gcaggagtat ttatgaaaat aaaacgtcgt ttttcctgga aaaaaaaaaa 



<210> 29 

<211> 2321 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> niisc_feature 

<223> Incyte ID No; 1621554CB1 



j"^Sg. gtcgaggggg gatooogact ccjgtcogg l^SlI 1°" 

s= sssss aJS| |gg c s«s 

USSS? ^iJlTcS^S 21^" ?ISSJ |gg III 
ksSS ^^SSgS? alSSSlK aSSSS SSS |iW 4|o 

sis ?ii?srtS s^ss? fgSs; S.0 
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gagttcatt. agaagcgacg acaaggctg cagcacttcc ttgaaaaggt cctgcagagt 600 
gtggttctcc tgtcagacag ccagttgcac "attcccgc | ^ cattcttcga 720 
gaStagaag cctgtgtcca 999=^9^1^ aalaaaagS agagctcttc tcacctggct 780 
tatgctatgt caaactgtgg ^tgggcccag ^^aga^ggc ^9^^ g^gctctccc 840 

aaaggagacc agcctaagag ttgctgcttt ^tt^^f^^ g^gctccagt tgttgactct 900 
tcaccgcctc ccagtgaaga faaggaccat "^gaagtgt f S atgctgtgat 960 

gaggttcctt ccttggaaag t'^^'^actctc c^acccctct = tgtgggagga 1020 

tttggaagac ccaaagaggg aacctccact ^"cagtccg ^9 gtgagctctc 1080 

gatcatgctg tgcctttgga ccctggtcag "agaaacag ggtgggactg 1140 

ggttctgctc tgagatggtc agagaagatg ^gggccagga S^^ttactca ^^00 
ggcacalggc aggtatgtgg gag^^tgggc tgcttagtgt ^ttctag ^^^^^^^ ^260 
ggctgattga cagaggtcag tcattacagc cccttatgcc ^^cccca gy « ^320 
tItgSgatg tttgtaagtt aaacataa^ ^lall^l Satgtaggi ecaggagtat 1380 
tltlttiictc tcctgggatc tgagtttctg c^g^tcattt Sgjgragg 3^440 

ctcctcaggt gaccagtttt gg9|^=^^9^ ?cccclt^t ggScttl agtttttgct 1500 
catcatccca ctgggagtgg ttatgttgta t^^ccatctc 99 ttgcatggac 1560 

gtagccctag agcactttgt ttgtgggagg ct9gcctctt 9^= accactgaat 1620 

Igggggatga atatttactt tcccacctcc ttgctttttc tttcac g ^ggo 
gglactggtg ctgtgactcc tgctgctggg 9atttatgtc ^cgag « tggcc 1740 

agtggagcct gagacctgca caacagctca tggtcatgca g^ga^g ^ ^^00 
acagccagag ggaacagtaa cagcccaggg g^^tttattt ^9ggaa gg « ^agag 1860 
Sttactgtc tcttctggtt ataaagcaga catgtggcca tcttttccg^ ^?^^etccag 1920 
tgggctcctt tctttttgga atccttttct tctcctttgg 9 | ^gggctttgt 1980 
ggcttccgcc accagcgtct ^tgctgtgtt 9cgcagtg « ^ cgctctcctt 2040 

ttctgcctgc ctgaaagaga g99ctctggg 9atggagatg |9aaa ^ gagccagagc 2100 
cagacaatga ggcattctgt cctcctgctg ^cattctcca ^ tgtgtgtgtg 2160 

tggtaggagc cgagtgccac aggcattctg ^attgctcta ^ ^ tgtctgtggg 2220 
ft^ftS^t ScSc^r. gctgSSc SSla tttagttcag agtggagggg 2280 
cttSc^tg aaatLaatg caagtattta aaaaaaaaaa a 



<210> 30 

<211> 3581 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1751553CB1 



lZ:il,o gggcggccag cgggcgttgg gacgtggaga cccggagagg gcggcggg- 60^ 
ccqagcgggg aaaggcgcgg cgtggcttgg ctcaggtgcg ^^^^^ aagattaata 180 

gctcagSIt gtggggggcg cctgtgaggg cgcccgccgt 3^99^9^99 gg^^^^^^ ^40 
ttaagattgg aagtttgtgt cttttgctgg atattgg^a ^^9 9 acaacttttc 300 
atttitaaag agttgctgtg gaggatgttt =tatggtga^ aaS^acta tctctgtacc 360 
tgtggaaaga gattttaaag cagcagtccc aaatagtcaa aa g ^tttgctttc 420 

tlclttgact tctgtttctg taaagcctca g^tggctgt actgagg ^tatctta 480 

caaattacca tctgatggca aagaagtacc atttgtggtg 9 t^tgccag 540 

cattcaaccc aggacacaag aaactccttc acatctggaa 9 | gacctagcta 600 
agcatctttt ggagatcgaa aggtagaact "ccagttca « 9 « t^^^gctt 660 
tgatgtgtat aacccattct atatgtatca 9cacattcca 9 « ggacgaacaa 720 
tcctccccgt tcagaagtga cgagactgta tggatcggtt ^gg ^^tctcagcg 780 

acttcccggt tccoctgggc taagcaaatc tatgtttgat «ta ^^gg^aaaa 840 

attcatccag agacatgatt cattgtccag ^tacccagn ag aaagggactt 900 

ttctcagggg agtaacagaa gc^tggatac aattactcta ^^|9g 9 9 tcacagt 960 
tgggagactg aatgtgaaat tgttttataa ttcttcagta gaa= « ^ ctgtttctat 1020 
ttfacigtgc agagatttaa gttggccctc tagttatg^ IctSgcca a^aaggttc 1080 
aaaaggaata cttacattgc ccaaaccagt g^atttcaaa l^^^ atctacaaac 1140 
caaclctatt gaatttatgg aaacgtttgt atttgctatt aaactccaaa a ^^00 
tgtalgactt gtatttaaga ttcaaaccca ^ft^^^^? teSttgg atltalcacc 1260 
ctcaatgtca ctcagaaccc ttagcacaca 9g^^|g^^ gggacttgtt ttcaagcagt 1320 
accttcaaaa atttctgttt gccatgcaga actt^attg 999 » catcaacacc 1380 
aaatagcaga attcagttac aaattctt^ ?Itatttagc ?cgggagagt tgatttataa 1440 
tctgactttg agttttttcg tgaaggtggg aatgtttagc ^^ggg g^ agactatgat 1500 
gaaaaagaca cgcttactga aggcctccaa t9gaagagtc aag^ggg » ^ogaag 1560 
ttttccactt atacagagtg aaaaagaaat tgtttccctc attaay 
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ctctgtaaga agaaaacact ttgtgggcca oat^^^ta agtgaagaca |taataacat 1620 
tgaalcagtg aaccagtgga aagagacagt ^^aaatcca g^9| « cttgacattc 1740 
gcacaaatta aatccatctt gaagacttca ^acattaatc Jg^^ « g^caaatttg 1800 
?tttagaaga cttatgattt ^aatttgcta |atlgaaaat ttgataagaa I860 

tcaatttatg ggggctataa ttatggtata ^aatgcac 9 « tgcaacaaga 1920 

aatgtaatga attttatcag atatccaaag ^^^H Saaattata aagtctgtga 1980 
gaclcagaca gtaaaatcaa agtattatta H^l^ct^ gcttttgtat tttaagcttt 2040 
laatatcaac catagatagt tctttctata "atgctcc ^ ^ atttagaaga 2100 
acttagatat tcaaaacctg gtatatcaag tctctgttag tacca gg^ tgtttacctc 2160 
ctttaccatt atttcagtgc taggcattat t^^ttaggtc ^^99 ttctatattt 2220 

ttgctatgta ttttctcccg ^taaa^tga attgaaccat tt^ ^^^^^ acattatccc 2280 
ggagaaagtt tgtgccctgt gttttataat aactacattt tggcaggaat 2340 

tttitaagct acttatctcc aaaaaacttc ^gaaategaa tactttgctg 2400 

aattgaaaac accagaaggt tg^agtttaa ttggaaaccc ag^^^ gagtatcatc 2460 
ttttcttccc tcaaatattt tactatttgt "tattcgg Itagaaaatt cagtatttta 2520 
catatggtcc atcctaattc acagaat^ ^ttagl^S? tctetaattt accgtgtttg 2580 
tgataatcac ttcgttttta gtttttaaaa "^=9^^. tgtgtaacaa atattttcta 2640 
agtattttct catttttttc ataaccatac ^tgattatac ^^J'^ tcagcttaat 2700 

ttgcagtttt ctttccagta cttattagaa ^tcagtaccc 39 acattatacc 2760 

tg=[ccltaag aactgtggcc aaaaagaaca gttttttgga S^agg ? | ^tgtgttctc 2820 
tlattttaga aaatctcact ttatttttgc taataagtag « agaaagatgg 2880 

aitcttccct ttttttctgc ccccattctt |^tttgtccc ^gg y^^^ ttattccagc 2940 
tgatatttta ggccaggagt ataccttgct ^taacccaag « ctcttttccc 3000 

tcctatgttc tgtgtatatc ftaacattt tcccaaataa acactc^^^^ gcttctactg 3060 
taaataccat ctcctcaagc tacaaaatgt c<=acacccud ^ ^ ttttgtttta 3120 

Sgatttt gtggtaccag tactctctgc cactgaacat tttgaaatat ^^^^ g^go 
gatttgcaaa aaatgacata taggtcagta ctcacatgga tt^ttaag ^^^^^^ 3340 
tgtgataata ttttgaatct gagacgaata ^aacttttaa aaatcg ^ttco 3300 

cttttttttt tagagcagtt gtaggttaac ^g^aattg ^9^99 ^ caccacaatg 3360 
tttctcccct gacaaagccc tcaacagcct ^^H^l ^gtcatccl aagttcatgg 3420 
gtacatttgt tacaatcaat gaacctactc tg^^^^ca tgatgatatg 3480 

SScaSi SSiraS SSS g t |cct|al atcctctgc aaataaaact 3540 
attttaatga aacctaaaga agaaaatgta tagaaatacg c 



<210> 31 

<211> 1014 

<212> DNA 

<213> Homo sapiens 

<220> 

<?21> misc feature 

<223> ^cyte ID No: 1832403CB1 



<400> 31 ^f^^^^rr^-a aotttcaggc ttcaggacaa ttcgtgatgg 60 

cccacgcgtc cgccggcggc cctcgcggca gg^togggc cccacaggcc 120 

cgggggctgg ttccgccgct gtatcggggg ^^ggg^^^^^ cictgticag cagctcgact 180 
gSacctttt cgccgaaggg ^tgctggagt tcctgcgacc ^gctgcg^ g ^^^^^ag 240 
Itcacgtaca cgccgtcaga gagagccagg t^gagctccg gg^ gacccctatg 300 

ccacalaact gtgccgoata aatgaggatc ^^^99^90 l^^^^^^ Itacagaatg 360 
ttaaglagct acttaatgcc ^ggcgacgcg Jtgtcttggt ^^a^^ cgcaggagag 420 
ctcaggaacg actgagacgg faaaccaca Stg^m^ claataacag atgagcctat 480 
caatgctgga ttcgggaatt tacccccctg g°tccccagg ^ tgcaaagatc 540 

ggactcagta gcacaagtac tgttccccag °tgccttgcc . gtttgtttga 600 

ctaggagaca gtccocatag accttcagac atta^a^ ^^LcLaca gccagaatga 660 
agcacttcgt cttacccatt tatgtagggg ^^=^^9g^^ tttcacaaat gagctgaggc 720 
ggttcccaaa ggacttacat taattatggc tcttgcttcc tagttaacca 780 

ctctactttt ttttttaagc tgcatacttg ^ggcttacct tctc gg^ atgtcctcta 840 
gaggggcttc ctttgtatgt tacatgcctg 9tt^^^g9g rg^^^gcagg caaagtcaaa 900 
Ic?g?gactt ctcattttcc tgtttacact 9gg9atttgg agg|99C^99 ^^g^lg^gtt 960 

SStca Sfattfal SS^SS ^LgatlL aaaaaaaaaa aagg 



<210> 32 

<211> 4195 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc„feature 

<223> incyte ID No: 19717 47CB1 



^ZUU gc»=ctcc. ^ccagtoc^ |»»tg|gc, g^tcco. |J,,tg|cac 60^ 
ccacctgot gg=ootgg=g oggcgggtg. ^ccctgg^ ag^W|«| gcgtjgt" 300 

ssss? spsg li^gg ~? iro 

atggctcaat aagattgtgg cccaggtctg S^^ttcctg 99^=1^^^ al^atttac 600 
tctggctgaa actgtggctc cggctgttag ggs^tctaac f^^^^^^g^ aggttcaccc 660 
atttacacga gtggaactgg gtgaaaagcc attgcgcatc attggagtca ^^o 
aggtcagaga aaagagcaga tcctgctgga ^tt^=J^ g?caagSca tgcagftlca 780 
gattgatgtg gaagtgaaga aatatttttg caaagcagga gj-^- yaa ^= ^tgt 840 
tggcgttttg cgggtgatac tggagccact cattggggac gtcccttcg l^^^^J^^ 
gtcaatgtfcc ttcatccgac gcccgacfcct ^gacatoaac ^1 f ^ ccattgctgc 960 
gctggatatc ccaggactta gctcactctc tg^caccatg ^tcacgga io20 
cttcctcgtg ttgcccaacc gattactggt sccccttgtg "tgaccttc a^^^^g g ^^^^ 
tcagttgcgt tcccctctgc c^^ggggcat t^ttcgaatt ^^^gg ^^^^^^^^^^ ^^^^ 

gctgagttcc aaggacaaat atgtgaaggg ^^^gatcgag 99 aactcaaccc 1200 

acttgtgcgt ttgggtaccc agacattctg °agtcgtgtc ^ttga g 5 ^ 1260 

acagtgggga gagacttatg aggtgatggt acacgaggtc meagre as ^^^ggatgt 1320 
ggaggtgttc gacaaggatc cagataaaga tg^^tttctg 9gcagaacga 9 ^^^^ 
agggaaggtg ttacaggcta 9°gttctgga tgattggttc c^t^^acaag 9 J9g 9g ^^^^ 
ccaagttcac ttgaggctag aatggctgtc acttttgwa 9atgc a a « 3^500 

ggttctacag tggaattggg gagtctcctc t^gaccagt ||^^^9tcag ^ 9 ^^gg 
Igttgtctac ctggatcggg °Jcaggatot tcctctgaag ^l^^l tctacagtac 1620 
ccctatggta caactgtcaa ttcaggatgt 9actcaggag ag^^as « aaagccagga 1680 
caactgccca gtgtgggagg ^agcgttccg 9ttcttccca ^^9 tgacgctgcc 1740 

gctcgatgtg caagtgaagg atgattccag gf^^^tgact "aggagc « « ^gOO 
tctggcccgc ctgctgactg ccccagaact |atcctggac |agtggttcc ag^ g^^a ^gg^ 
ctctggtcca aactccagac tctatatgaa ^ctagtcatg ^99 ^ tggacagtga 1920 
atcagaaata tgcttcccca cggtgcctgg ttgtcctggt g=^g9ga 9 ^ga 1980 

gaatccccag agaggcagca gtgtggatgc ^ccacctcga ccctgtcaca 9 3040 
tagccagttt gggactgagc atgtgcttcg 9at=catgta ^^|9aggc ^. 3100 

tgccaaagac cgtttcttgg ggggactggt gaagggcaag tcagaccccc ^ 9 ^leo 
alagttggca ggacgaagct tccggagcca gt^Sttcgg 9|^9|tctca ^ g 9 ^220 
gaatgaggtt tttgaggtga tcgtcacatc ^g^tccaggc °a^9^ « gtctcaccac 2280 

ir.si ssss" ESI fefirt bsss 
ixi gsig s » i^i 

ttcagcccct gtctgggatg apgtgcctc ctttctcacc ^gg^ ccctgcccct 2700 

cctagagttg caggttcggg gtgagggcac tggcgtgctg gg^^cactat ^^9 2760 
ctcagagctc ctcgtggctg accagctctg cttggaccgc tggj«acac 9 9 |g ^^^^ 
tcaggggcag gtgctactga gagcacagct agggatcctg gtg^ccagc ^^^fj^^ ^gg^ 
ggaagctcat agccacagct acagccaoag ^tcctcatcg « 9 9 « « ^taacaca 2940 
ctcgggggga ccccctcaca tcacctcctc ^gcccca^g ^ tgactctgtg 3000 

tgttgacagt ccccttgagg ^t^f^gccgg g^^tctgggc ^jggcgaa a gggg 
gtactacagt gaagaacgaa agctggtcag ^attgttcat gg^^9^^99 ^a^cg 3120 

gaatggacgt gatcctcctg atccctatgt Qtcactgttg ftactgccag ^ 9 « ^^^^ 
aoacaccaaa aggaggacct cacagaagaa gaggaccctg agtcccgaat "f^J^^ ^ 3340 

»i i= ssggfg sssi giiil SSi h 
ssssc isiSfcS g|g « 3530 

tctcgctcca tcaccgcctc aatgtgatga gc^taaagct agggccaag 9g^| 3540. 

ISSS SISSI S= ct g^cggggg s-J 

SSr^^S SSSS rtS^a^^g ctt^tgccaa 37.0 
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atacaacttt ggaaggatct ttttttcttt aactagatgg tcaccttctt ccctaccaca 3780 

SSSSl? S'tSgS >cKS ISftSfa SSgSS 0 

SSctgccI agccactgga tcttacatta aacatcatac tcaaaaaaaa aaaaa 4195 

<210> 33 

<211> 3343 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> ndsc_fGatiire 

<223> Incyte ID No: 2285348CB1 

lis sra rs s lii | 

slsii ^ giiii iSi iS E iS i 

iiiisgiiiliisisisi 

i« tg il Is =s iSi ri 

S^oi ^SS??=? l"taS SSS«g« tgtcta.,,t M^actgca 10|0 
S sis sis? S« SSS^cSS i» S 

ctaaalltat gctctacgat tatgaacctc ctgagcctgg ccaatgagga ctctgtccct 1200 

™g ^ g« IS Si 

hf-ri-attaat aaatacagtt cacagcagca gtagaattca ttaaaaccai: cgatg^t^^y^ x 

sssiiiiiiissi 

bSss^ SSI SS?ggS SSSS ?5S5 s -ssss 

Sii S= 

aaaatttgat ggacagggct gcctttaatg agtgtgaagg tcactaagtc acttagacat 2160 
Sccg?S algttSfga 'cctgcatta ^ga^tagac tgattaccat -atgacata 2220 

ttcl^tga aScacagIc caagaattca gtgaatgtca ttttttaaaa aactaatttg 2400 
tatScgc ?c?agtglta caa^ttttac tagtgat^ 't^l^^ IlltcSctg 2520 

atfccttataa aaaaaaatat ctattttggc aggtttctgt gcctttatct ^^lon 

i is sss-^ sgfi 

&s sssir. ssas ^sss't ssss =31 

SftSiS llSSfS ISo 
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attttatttt attttatttt tttgagacag agtctcaccc tgttgcc-g ^ctggagtgc 2940 
aatggtgcga tcttggctca ctgcaacctc ^^^ctcccag "caagcga ^^^^^^^^^ ^^^^ 
cagcctcgcg agtagctagg actacaggcg ^atgtcacca ^gcccgg 3^20 

ss^s s« sssss? i issi ?«s i^i 
t^^t r^it^ «^ ? g ^ i?2S 

ttaaactagt actatgaaat gtccttttga atgttaggtc aag 

<210> 34 

<211> 2456 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc„feature 

<223> Incyte ID No: 2374186CB1 

ccagcacgtg gatcagcgtc gccagggtct ggaagattcc ^l^^^^H tgaattcaga 480 

mmmmmmi 

riiilil^!^.'^i. 

ssgs? ^ssf. s» sifH ™- issi 
SrS S SiS S lis 
Si Si IS S ipiS E 
sli^ sM IsS iilS 

iiipiisiiiisi 
siiiiiHSiiiigi 

t-amtcctca ttattttaat acaggaaaag aaaagattta gtaattcccc ^^^^^^^J:^^ ,^q« 

Ife ss^ss ps» ^ iii sgg 

?2« 'tSSlS SSftlSI f^^aS SSi'^ 



<210> 35 
<211> 2004 
<212> 

<213> Homo sapiens 
<220> 
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<221> misc_feat\ire 

<223> incyte ID No: 2476232CB1 



PCT/USOO/34919 



== fS s#3 3S SS S 

« =i ps is SS « 5 

ES piss ss SH 2S S 

ocooggatgc cMjggeoao tct^gaj ^^^^ tSSS tcttcatS. 1140 
SSSS SS"^ S ™ c=gg tc« 13.0 

s=g? IS i 11 i SSib gsss^^ itfo 

mmmmmmi 

ssss srtsss Sissi ggg 
isr^ ssst s=s?s isssss g?.t«»tt „so 



atgaaaaata gtggcaggct cgta 

<210> 36 

<211> 4019 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> Incyte ID No: 2503986CB1 



iissfe ssss « SI 1 IS i S S 0 

SSSc? SSSS I^Sc ? t« .40 

S2S?S »S I g™ 

atgtctctgg .gttoCg.tt g^jagtggg gSSacSa cSSS== 480 

» liii SI pt^' #Ji !S! Hi 

^cacagcc gccagaagat atcagctgca ttgcatggaa ^^gacaagtt cagcatattt 560 

tagcatcagc cagtcccagt ggccgggcca ctgtatggga tcttaggaaa 

tcatcaaagt cagtgaccat agtaacagaa tgcattgttc tgggttggca ^ggcatcctg 'o" 

iss ^ =1 iili Sii iilS i 

tggcaattgc ttggagcatg gcagatcctg ^attgttact g^||ngrgga « « ^(,30 
agattctctg ctccaatcca aacacaggag aggtgttata ^gaacttccc ^^Y^^^^ ^qqq 
agtggtgctt cgatattcag tggtgtcccc gaaatcctgc tgtcttatca gctgctccgc lua 



36/44 



wo 01/46256 



PCT/DSOO/34919 



tt,.«„o, -toagatt tatto«t^ t^gag^ -»ggt tt^cga l|.o 
aoaagocacc =>«gtggatl <=3»M3"g ctStSmg agoigSS SgSgoagc 1380 

ssas^ stM sr^|2 S I'-lb itss 
^i^. ssfsi lili fe= ^S3ss iiro 

attctcgtgg aaaatacctt ga^cttctag S^tacagaaa |5 9 tctgaccaag 1680 

ttgctttggc cttgaacaaa gtggatggag °caatgtggc ^"caaaga « ^^^q 
tagcacagag tgatggggag gagagccctg ctgctgaaga gagctcttg g^^^^^^^ ^goo 
ttaaagagga aaaagaagaa tctgaatttc taccctcatc aattttgaga 1860 

«tS l^ini^ S ||ccga ?gccattata ttggccata^^ 1..0 

tgcagactca agcatgtctc tgctatattt S^gcagggaa ^9 « ^ . attgagaaag 2280 
^tgiactaa agctcaagat ggaagccacc cttt^tcact t^aggatctg atcgag 9 ^^^^ 
ttgtcatcct gcgaaaagct gtgcaactca ctcaagccat Macactagt acrgc gg g ^^^^ 
ttctcttggc tgcgaagatg agtcagtatg ccaatttgtt gg^agctcag f I 2460 

i^smi gS£S| lig '^fi nil 

SSSS^ Scagcj -If- f,- 

t^ttoggaac aggggggtoa goaatgtato gaootoagoa .^gtcto 2830 

oaaacBctta ccotaacaoc S^SSotS "S«tgrt littclttco 2940 

tl^SS SlcSrtS a^c gogg ac gggct cca^tcat 3000 

MS SSlSgE =1« i g !i Ms =i lifo 

tacccaaaaa gaagaagatg cctgaaaact tcatgcctcc tg^tcccatc ac ^^^^^ 
tcatgaaccc gttgggtgac ccccagtcac aaatgctgca g^aacagcct tc S 9 ^^^^ 
taccactgtc aagccagtct tcattcccac agccacatct t^^aggtggc «g ^^^^ 

ss^^^ ?f~ S Sss lll^KlS nil 

atgatgccag caaacgtttg gagtttctgt atgataaact taggg 9 ^^^t^etcag 3660 
gStltS?' ?gc=atttK cSaagaaac -tgtt^ "JSltSc 

ESI SI S3 iHii ibs'/ 



3720 
3780 
3840 
3900 
3960 
4019 



<210> 37 

<211> 1793 

<212> DNZV 

<213> Homo sapiens 



<220> 

<221> misc^feature 

<223> Incyte ID No: 2596566CB1 



<400> 37 

cgacgccgga cgtgcggcag 
gttctccacg ctggttcgca 
gacagagtac tattttgaag 
acctgatcta ttatgatgac 
tggactgcat caacatccgc 
agtcaaaaga ctgcatgctc 
cagaaagcac agatgattgc 



ttgcaggcga gcaggcgagg 
ggtgaagaga tggcgtttgt 
cgctggaaga agaactggtt 
cagactcggc agaatatcga 
acggggcagg aatgtcggga 
cagattgttt gtcgagatgg 
ttggcctgga aatttacact 



aatcgccgtg gcgtcttggt 60 
gaagagtggc tggttgctgc 120 
tgatctgtgg tcggatggtc 180 
ggataaggtc cacatgccaa 240 
tactcagccc ccggatggaa 300 
gaaaacaatt agtctttgtg 360 
ccaagattct aggacaaaca 420 
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ss ss ss sss ss i 

-iSt Sfi =S is SS s 

sssss 1= gsg rSs E-k: ssssss b:: 

octgaggcag gagaatagtg tgaacccagg aggcggagct tgcagtgagc caagatagtg i/4u 
cScScact tclgcctggg tgacagagtg agactctgtc tcaaaaaaaa aaa 1793 

<210> 38 

<211> 4427 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> inisc_feature 

<223> Incyte ID No: 2685253CB1 

S SS sSS «= Sp Si I 

mmmmmmi 

mmmmmMi 

mmmw4Mm=. 

cttoctctaa atattgaaag gaatattcaa aagggtgatt atgatgtg^ ^S^^^ttat 1080 

~ lira s Si sas ini b 
is IS Si si 

llgi Si! iSif IS ii!El Ps i 

tcaaSSSa ttglLgatc aaagaatgta aggcaaagac aaaatgattt taagaaaatg 1620 

i s ^isb ssss »s isst^i s s 
ssls s« ssss "^i! S 

S« MS SSScI «S iSS^iS II.. 
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gtgtgttctc tgcagtcact gaagggggtt ctggagtgca agccgggaga ggccagtgtc 2040 
ftccLcaac ctaaiacaca ggaggaggtt tgccagctaa gcatcaatat fatgcaggtt 2100 
tttatatact gtctggaaca gttgagcacc aagcctgatg cagatataga tactacacat 2160 
ctctctgttg Itgtttcttc ccqtgacttg tttggaagta tccatgaaga cttcagcttg 2220 
IcctcaSac agcgcctttt gatagtccta agtaattgct gctatctaga acgtcacacc 2280 
ttcctaSta tcgoagaaca ttttgaaaag cacaacttcc agggaafcaga aaaaatcaca 2340 
caggSagca tggcctcatt gaaagaacta gatcaaagac tctttgaaaa ttacatcgag 2400 
ttilaaglag atcccatcgt tggctcctta gaacctggaa tttatgcagg atattttgat 2460 
tgSaggact gcctgcctcc aacaggtgtc agaaactatt taaaagaagc actggtgaat 2520 
ataattaccq tgcatgcaga ggtgttcacc atttccaaag aactggtccc tcgggtacta 2580 
tccaagtga SgScaSt ttctgaagag ctcagtcgac tgatgcagtg tgtttcatcc 2640 
ttcagcaaaa atggagcttt acaggcgaga cttgaaatct gtgctttgag ggacactgtg 2700 
gctgt^cc tglcacccga aagcaagtca agttttaagc aggctttgga agccctgccc 2760 
cagctttcca gtggagcaga taaaaagtta ctggaagagc tcctgaacaa gttcaagagt 2820 
Igcatgcact tgclgltcac ctgtttccaa gcagcttctt caaccatgat gaaaacataa 2880 
atatctgcca cataaaagaa gtccaggaaa ataacacgta ataagactgt tcactctcta 2940 
agtaccctaa aggtatttgg tgtattaaac attgggtttg ccatttttct °ttttttctt 3000 
cctctgactt cgaaaattgt tggtatacat ttcaaccaaa atgacctcat tt^aaatgcc 3060 
caggaagtat ttgttttgcc attcttacta gatcagatcc tgtatgactt taaaatacat 3120 
??taaaltat attttgcatc tcagcagaag gttgagtgct gcaggaagag ctgcttctgc 3180 
agggagtttt ctggaatggc gtggcacata ttgaagcatt tctcagtgtc ccgttcacgg 3240 
aagcgcaggc agctcccacg acacacggag agactgactg atacgagatt tggaaagcta 3300 
tgtagacltc tttggagctc ttactgtcct aaactgtaca gctgtgctta aaacccttat 3360 
ttcatataaa tggccttaag ttttctaatt caagcgggtt tttggaaaaa tttatggtct 3420 
ccI?te^St acatattaci actggggtag attatttgtg gtccaggtgt ctgt^ttta 3480 
actttgcgtt ttgctatctg atttttattt ttcacagggc taagcatgag ctttcattct 3540 
cactclctct taltttgtcg tgcgtcacta cacatgcacg tgttgcagtc cctgaggccc 3600 
tgtgtgttat ctgtgatgga gtgtgaatgt gtaacgggca ctgtgttaca ^tctcaggtg 3560 
ttgidgggc ggtcgcagac ttcagggtcc cctaacggaa aggccaggct ccgcgtggac 3720 
ggccalctcc Itgcccgctc cttcagcagg tgactgtctc tgccacttct tacctgctga 3780 
Sgatcttgc tcagtagctg gaacaatgct gctgtcacac agtctcttct ctgaaacttc 3840 
algatgctcc ttggtcacca ggcaatggga gctgtagacc agccgcatgc acttgcccca 3900 
cittclctgc tgactggctt cactggaata ggttcaggtc accgggactt ctctcagagt 3960 
cagcagccla cccaactccc tcataccgtc gcatctgaaa attttcagag aggaattctc 4020 
tttgtlgtcg gcttgtcagg gttttcaagt ttttctatgg ttatttttaa atctcttttt 4080 
ttaaatccta aaattaatgt cctcattaat gcaagtaatt ttagagcaca gtgtagccat 4140 
gtacStttc c^tatttaa aataaataaa tgtgttggtt catttggtgt atggggggac 4200 
Itcatlgtaa ctggaaatga aagtttaatc tttcacaaat gtgcttgata aagctgttaa 4260 
galttgtctt tcctaagtat gttaactgtt gtatctaact caactgtgag ctaagtttaa 4320 
aaacticaat catgaatgtt tgtgatattt taaatatcta ctacagcaca ttcagccaat 4380 
aaagaatctg cctttaatgt aagttgtaat ttacttgaaa aaaaaaa 



<210> 39 

<211> 1583 

<21-2s. ntlA 

<213> Homo sapiens 



<220> 

<221> inisc_feature 

<223> Incyte ID No: 2762252CB1 



<400> 39 

gagctgaaag ccaggaacat ccgaggagaa 
ctggaaatcc agacaccccc acccccaccc 
gtttgcccga cagaccatgg ctccctttgg 
cagatctcca actaaagaca tggattcaga 
agaagagaag cttgtctgtg ggctgactaa 
tttgcttgag gaacatgagg ctacgtttgg 
tgattactgc atcatagaga agtggagagg 
aatcctgaag ctttggaaag cgtggggaga 
taaagcagat gcttttcttc cagttccttt 
aaacacagaa aaattgtggg agctcagccc 
taaacaaaaa agaatagtca ggaaaacttt 
agtttctcat gatcgagata atatggagac 
tactattcat cagcaagtca agagaatgaa 
agctaagttc catcttgatc gagtagaaaa 
tttaatgccc agtttcagtg aagttgagca 



gagaaagctt ccagccctcc tcccttcacc 60 
tcagatcact ttaagataat ttctttattc 120 
aagaaacttg ctaaagactc ggcataaaaa 180 
agagaaggaa attgtggttt gggtttgcca 240 
acgcaccacc tctgctgatg tcatccaggc 300 
agagaaacga tttcttctgg ggziagcccag 360 
ctccgagagg gttcttcctc cactaactag 420 
tgagcagccc aatatgcaat ttgttttggt 480 
gtggcggaca gctgaagcca aattagtgca 540 
agcaaactac atgaagactt taccaccaga 600 
ccggaaactg gctaaaatta agcaggacac 660 
attagttcat ctgatcattt cccaggacca 720 
agagctggat ctggaaattg aaaagtgtga 780 
tgatggagaa aactatgttc aggatgcata 840 
aaatctagac ttgcagtatg aggaaaacca 900 
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gactctggag gacctgagcg aaagtgatgg 
ttaccgaata ctcattgata agctctctgc 
cattgatata aatgaagatg cggaagggga 
agagagtgtt aagtgtgatt tggagaaaag 
tttgagtggc atccagaaag agattaaata 
agaatatgaa ctcctggcca aggaattcaa 
ccagttaaag gaaaacagag cgaaggaatc 
tccctttact caaagagtat ttagcaatta 
cagttctaac cacagtcagg actccgaaac 
atagttccaa tggctccttt ctgacctgct 
ggaaacctca ttttaaatat aacactcaaa 
agttaataaa aactcgagaa atg 

<210> 40 

<211> 1416 

<212> DNA 

<213> Homo sapiens 



aattgaacag ctggaagaac gactgaaata 960 
tgaaatagaa aaagaggtaa aaagtgtttg 1020 
agctgcaagt gaactggaaa gctctaattt 1080 
catgaaagct ggtttgaaaa ttcactctca 1140 
cagtgactca ttgcttcaga tgaaagcaaa 1200 
ttcacttcac attagcaaca aagatgggtg 1260 
tgaggttccc agtagcaatg gggagattcc 1320 
cacaaatgac acagactcgg acactggtat 1380 
aacagtagga gatgtggtgc tgttgtcaac 1440 
ttcatgtttt aatgtttgtt taatttaata 1500 
aaaaatgtaa atcatattgt agtattcaat 1560 



<220> 

<221> misc_featiire 

<223> Incyte ID No: 3452009CB1 



gfcccggcggg ttacagcgga ggcctaggtg gcagacaggg ggcccgggcc gctgcgtgtt 60 
gtcclcclla gatggagttc ctcctgggga acccgttcag cacacc^gtg gggcagtgcc 120 
tcgaaaaggc aacagatggc tccctgcaaa gtgaggattg gacgttgaat atggagatct 180 
gtgacatcat caatgagacg gaggaagggc caaaggatgc cattcgagcc ^tgaagaagc 240 
ggctcaacgg gaaccggaac tacagagagg tgatgctggc attaacagtg ctggagacat 300 
gtgtgaagaa ctgtggccac cgcttccaca tccttgtggc caaccgagat ttcatcgaca 360 
gtgttctggt caaaattata tctcccaaga acaaccctcc caccattgta caggacaaag 420 
tgcttgctct gatccaggca tgggctgatg cctttcgaag cagtcctgat ctcaccggcg 480 
ttgtgcacat atatgaggag ctgaagagga aaggggttga atttcchatg gcagacttgg 540 
acgctctgtc tcccatacac acaccacagc ggagtgtccc tgaagtggat ccagctgcga 600 
ccatgcccag gtcccaatca cagcagagga caagtgctgg ttcctattcc tcgccgcctc 660 
ctgctcccta ctccgcaccg caggccccag ctctgagtgt gactggdccc atcacagcca 720 
attcagaaca gattgccagg ctgcggagtg aactggacgt cgttcgagga aacacaaaag 780 
tcatgtctga gatgttaaca gaaatggtcc ctggacagga ggattcatct gatctggagt 840 
tgctgcagga gctcaacagg acctgtcggg ccatgcagca gcgcatcgtg gagctcatct 900 
cccgcgtgtc caatgaggag gtcaccgagg agctgctgca tgtgaacgat gacctcaaca 960 
acgtcttcct tcgatacgag aggtgggagc cggatttctt tttttttttt tttcccttga 1020 
aaagattgtt gccataacag agggatttct ttcataaaca aaggaaagag agggagaggg 1080 
agagagatta gagttggaaa atactcatca gagatgctaa tttgttcccg gctgacagga 1140 
aaaatgcttc ttgtcctcct gggaactgag cagcagaagc agcaggaagt tgacgtgcag 1200 
aattaggctc tgccctttgt aagcctcagg ctgctctgcc tttccccact gccccgccac 1260 
ccccagcccc agggaggggt agcacttggg ggcctctggg ccccagagct tctgcacctg 1320 
ctcctttgat gatcctttga atttgaaaag gattaaaggg aaaaaaaatc tctagtaaga 1380 
aaaaggattt cctctttccc atctgtgaaa aaaaaa f i^-*-^ 



<210> 41 

<211> 1662 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> iaisc_feat\irG 

<223> Incyte ID No: 4644780CB1 

<400> 41 

cgatgccggc ggtcagtggt ccaggtccct 
cccacagccc tgagacgggg tgtcctcctc 
aaggcccaag gctggcattg cctggggctg 
ccatcctggg cctggaggaa gtgcggctga 
tgtggagcag ggcctctgtc cccttctctg 
cgggactggg gcgccgggga gcccagggca 
atgtaggctc tgtccttggg gggctggctt 
ctccggcaga ggatactcag gacagtcctg 
tcccctctga gcagcctggg gatggagcta 



tattctgcct tctcctcctg ctcctggacc 60 
tacgcaggtt tgagtacaag ctcagcttca 120 
gaataccctt ctggagccat catggagacg 180 
cgccatccat gaggaaccgg agtggcgccg 240 
cctgggaagt agaggtgcag atgagggtga 300 
tggccgtgtg gtacacccgg ggcaggggcc 360 
cgtgggacgg catcgggatc ttctttgact 420 
ccatccgtgt gctggccagc gacgggcaca 480 
gccaagggct gggctcctgt cattgggact 540 
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iiiiiiis^iii^i 

S2 SS sS™ ss= SS SIS is 
SS S SI Sp Sfe Is 

tctecctcct clftoalact gtaggcttct tcggctacgt ^cacttcagc -Sgaggagc 1380 
tgaacaagag ccttcaggag tgtctgtcca caggcagcct tcctctgggt =^tgcaccac 

^SSSS '^^c^m? t«SSS SS^oS S« » t ||o 

siss sssss r-'--^ "~ i?« 



<210> 42 

<211> 3387 

<212> HNA 

<213> Homo sapiens 

<220> 

<221> iuisc_feature 

<223> Incyte ID No: 4946103CB1 

<400> 42 



ccSStctcccr ttttggaaga cccgcctcgg cacagccagg ctcagtccgg ccttgcggta 60 

S ii «r= ssis 
™s & iSii Si S~ i 

aaaccccgaa ratrrcattta aaaattgaat cagcgatatc tccggaagaa 600 

2^ga?S^ gtcaag^ta gcScagc? ?ttggalgaa tc?gttccaa gtggagaaaa 660 

siiiissSisiii 
si IS s si Hi ^1 

S' =1 ws sss^ra ssagi K 

actaaccaat cacctcaaca atgtttttgt tcaacagggc cacgatcaga gttcgtctct 13^u 

Hiii sii iirs ss s 

«i SIE liSS ps lii i 

tga?ctSt gttgl^gaca ggaccaaatt tgataagatc tttgaacagg tactaagtga 1740 
a^hfToaaccc ctatgtctgg cagaacagga cttcataagt aaatttttca aaccacagca xouu 
acScS Itgcctggll ctltagctla agcagaggac ctggatggag ^aacattatc 1860 
Sc^at aattgtggca caccactgcc tgtttcatct gagaaagata tg^^^^^^^^ 
Ltgatgatt aaaatatttc gctgcattga gccagagctg aacaacctaa "gcattagg 1980 
aaacaaaatt cfatagcttta actctcttta tatgttagtc aaaatgagtc atcatgtgtg -iU4u 
Sctgclcaa Stgtggacc ctgcttcttt cctaagtact acattgggaa atgttttggt 2100 
SctgtcSt Sgaaltttg acaaatgcat tagtaaccaa ataaggcaaa tggaag^gt 2150 
Katctca allaagagtl aagttggaat tcttccattt gttgctgaat ttgaagaatt 2220 
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tactaaactt gcagaatcaa tcttcaaaaa tgctgagcgt cgtggagacc tggataaagc 2280 
ataclcc^ cttltcagag gagtatttgt taatgtggag aaagtagcaa atgaaagcca 2340 
aaaatcc^ agggatgtgg ttatgatgga aaactttcac catatttttg caactctttc 2400 
tcgStgaaa atltcaStc tagaigcaga aaaaaaagaa gccaaacaaa ^atacacaga 2460 
tcIcctCag tchtatgtca tttactcttt aggacaacct cttgaaaaac taaatcattt 2520 
llltallaat attgaagctc gcgtggcaca gggcataagg gaggaggaag taagttacca 2580 
IcttSa??? ^clScaag Llttcgtaa agtcattaag ^agtaccctg g^aaggaagt 2640 
aaaaaaaagt ctagataacc tctacaagaa agttgataaa catttatgtg aagaagagaa Z/ou 
^c??cag gtgjtgtggc actccatgca agatgaattt atacgccagt ataagcactt 2760 
tgSSgtttg ItSctclct gttatcctgg atctggtgtt acaatggaat tcactattca 2820 
cgaaggcttg * « ^ . ccaacattgc acagtcccac taaaccttgt gaaagaagaa 2880 

SBSs Hfisi i« igi ?r 

nai-actacaa atatttaaat gcaaaattac caacctatat agcagtttta "^^f^^^^^ 

SSScIta cSacttaag Lttcatgtc accataaaat gcc^tagca tttctcaatg 3120 

actagatggg aaattttcct ttattgccta gctgcttgtg tttgagtggt tgtcctatga 3i80 

acStacatt tggagttctt cagctttcac tacttctctg ttgcttgcta atcatgtaac 3240 

ta^alSta ctgtacaaaa ttgttttttc acactaacaa atgtgtatat ggagaagagg 3300 

gctcaS^ga SScatttg tgLcttaga tttttgagga ttatgtgact agtaataaat 3360 
gtgaaataaa ttttcaaaaa aaaaaaa 



<210> 43 

<211> 1043 

<212> DMA. 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 5562355CB1 

<400> 43 



acSSqcSca tggacaacgc ggggaaggag cgtgaggcag tacagctgat ggcggaggcc 60 
aaSSag tcLggcctc ccactccttc ctccgagggc tgtttggagg aaacacaaga 120 
It^SgSg ctStgaaat gtataccaga gctgcaaata tgttcaagat ggctaaaaat 180 
SSSgctg caggalacgc Sttttgtcag gcagccaagc tccacatgca gcttcagagc 240 
S^citgact ctictacclg ctttgtggat gctggaaatg cttacaaaaa ggcagatocc 300 
^llalctl tclactgctt aaatgcagcc atcgacattt acacagacat gggaaggttt 360 
acScag ccSgcIcca ^tlctltt gcagagatct atgagactga acttgtagac 420. 
tttaagalg? ctattgcaca ttatgaacaa tctgctgatt attacaaagg agaagaatcc 480 
ScagctSg caaacaagtg totgctgaag gtggcagcat atgctgccca tcttgagcag 540 
taccSgaaci ccattgagat ctatgagcag gttggggcaa acacaatgga taatcctttg 600 
acgacltaca gtgcaaagga ttacttcttc aaagctgccc tctgccactt catagtagac 660 
Sttgaatg Iclagcttgc tcttgagcaa tatgaggaca tgtttccagc atttactgat 720 
t^Iaaaaaat gtaaattatt gaaaaaactc ctagaagctc atgaagaaca gaacagtgaa 780 
gcttlcaSg Lgcagtgaa Igaatttgac tcaatatctc scttggatca ggctgacc 840 
accatCTttac ttcgcatcaa aaagtccatc caaggggatg gagaaggaga tggagaccca sou 
^tSaato tttttgtctt tgtggcatgc agctaactcc tctttagttt tgtcttaggg 960 
Stgatc "Stjggat gcctatttia tggcttaatt ttgttgcata tgagccagac 1020 
ggcctgtgta ttgtttaagc teg 



<210> 44 
<211> 3207 
<212> 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 5678824CB1 

<400> 44 



ccJgaggtgg gagccctggg ccaaaatggc ggcctacctg cagtggcggc gcttcgtttt 60 
cttcgaSag gagctggtga aggagccgct gagcaatgat ggggccgctc ccggggccac 120 
acctacttct ggatccgctg cttccaagtt cctttgcctc cctcctggca tcactgtctg 180 
Stcaggc cgaggglgcc tggtctttgg agatatggaa ggccagatct ggttcttgcc 240 
acattcccta cagcttalag gcttccaaic ctacaaacta cgggtgacac acctgtacca 300 
aSaagcag cacaatattc tggcatct|t tggagaagat gaagagggca tcaacccctt 360 
ggttaagatc tggaacctgg agaagagaga tggtggcaat ccactctgca ctcgaatctt 420 



42/44 



XVO 01/46256 PCT/USOO/34919 

ccctactatt ccaggaacag agccaactgt tgtatcttgt ttgactgtcc atgaaaatct 480 
c^ctttata acclttggtt tcacagatgg cagtgttaca ttgaacaaag gagacatcac 540 
cl^oaSgg latagcSga cccagatttt gcacaagggc aactatcctg taactggatt 600 
ggcctttcgc cEScaggla agaccactca cttgtttgtt gtgacaacag agaacgteca 660 
o?cctatata atttctggaa aagactaccc tcgcgtggag ttggacaccc atggttgtgg 720 
nntnna^ll tcaaccctaa gtgacccttc tcaggacctg cagttcattg tggccgggga 780 
tgStgtgtc ScttgSc agcctgatga acgtgggccc tgcttcgcct ttgagggcca 840 
imifn^t Mccactgat ttagaggcta ccttatcatt gtctcccgtg accggaaggt 900 
ttttcccl^g tcagagt?S ccagcSgga ttcacagagc tccgacaagc agattct^ 960 
catctatSc ctgtgcaaca agttcatagc ctatagcacc gtctttgagg atgtagtgga 1020 
tgtgcttgct gagt|gggct ccctgtacgt gctgacgcgg gatgggcggg t^cacgcact 1080 
gcSSgaag gacacacaga ccaaactgga gatgctgttt aagaagaacc tatttgagat 1140 
ggcgStSc cttgccaaga gccagcatct ggacagtgat gggctggccc agattttcat 1200 
aclatatgga gaccatctct acagcaaggg caaccacgat ggggctgtcc agcaatatat 1260 
ccSacSt gg^gttgg agccatccta tgtgatccgc aagtttctgg atgcccagcg 1320 
cat^caSac ctgactgcct acctgcagac cctgcaccga caatccctgg ccaatgcc^ 1380 
ccaSStc ctgctcltca actgctatac caagctcaag gacagctcga agctggagga 1440 
attcatcaag aaaaagagtg agagtgaagt ccactttgat gtggagacag ccatcaaggt 1500 
cctccgSg Ictggctact actcccatgc cctgtatctg gcggagaacc atgcacatca 1560 
lllltaa^o ctglSgatcc agctagaaga cattaagaat tatcaggaag cccttcgata 1620 
ciSgS cSStg a^caggca^a gagcaacatg aagcgotacg g-agatcct 1680 
catgcaccac ataccagagc agacaactca gttgctgaag ggactttgta ctgattatcg 1740 
acccagcctc gaaggccgca gcgataggga ggccccaggc tgcagggcca actctgagga 1800 
attcltcccc atctttglca ataacccgcg agagctgaaa gccttcctag agcacatgag 1860 
ta^Igtgcag ccagactcac cccaggggat ctacgacaca ctccttgagc tgcgactgca 1920 
aS^tglgcc oacgagaagg atccacaggt caaagagaag cttcacgcag aggccatttc 1980 
cSctSg agtggtcgcl tctgcgacgt ctttgacaag gccctggtcc tgtgccagat- 2040 
acacaacttc caggatggtg tcctttacct ttatgagcag gggaagctgt tccagcagat 2100 
cataSctac cIcKgcIgl acgagcagta ccggcaggtc atcagcgtgt gtgagcgcca 2160 
tggfgagcag gacccctcct tgtgggagca ggccctcagc tacttcgctc g^aaggagga 2220 
gSctgoaag iagtatgtgg cagctgtoot caagcatatc gagaacaaga acctcatgcc 2280 
aectcttcta atagtgcaga ccctggccca caactccaca gocacactct ccgtcatcag 2340 
gStacctg gtccStL tacallaaca gagccagcag attgcacagg atgagctgcg 2400 
gSgcggcgJ taccgagagg agaccacccg tatccgccag gagatccaag agctcaaggc 2460 
cStcitalg attttccaaa agaocaagtg cagcatctgt aacagtgcct tggagttgcc 2520 
ctcaatccac ttcctgtgtg gccactcctt ccaccaacac tgctttgaga gttactcgga 2580 
ig?Stgct gactgcccci cctgcctccc tgaaaaccgg aaggtcatgg atatgatccg 2640 
^IccEg^ cagalacgag atctocatga tcaattccag catcagctca ggtgctccaa 2700 
taacagcttt tctgtgatt| ctgactactt tggcagaggt gttttcaaca aattgactct 2760 
actalccaac cctcccacag ccagactgac ctccagcctg gaggctgggc tgcaacgcga 2820 
cctlc?cSS cIctccSga ggglcactta agcagcctgg aggaagatgt gggcaacagt 2880 
gaSgaccga gSaacagL Iclltgggac ctgggcgggc gttacacaga aggctggctg 2940 
acSaoccIg ggctccaltc tcatctaatg tcacagccct cagaactaaa gcggactttc 3000 
tttccctgcl ttcttattta gtcagcttgc catccctcct cttcactagc agtgtagatc 3060 
attccagltc agtgggggag ggcacctcag caacctctga gtgtggacaa tagctgcttt 3120 
cttctctatc SaSIIiacc Iggctgtgct tgggtccttg ctctcagagt ctataaataa 3180 
aagaatataa tgattaeiaaa aaaaaaa 
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<223> Incyte ID No: 58709 62CB1 

<400> 45 



cJgaactaac cctctaaccc ttgggagtct gtgtgcagca gtgatgtgag ^tgcatccgg 60 
gctgaaatgg gagctcccgc tatggcactg gctcaaggag ctaatctaaa gagaaagtat 120 
tttg^galg acatcttggt tgcacccctc attgcatctt gactaattgc ttcoaactco 180 
aaaaacttcc agggacaatc tcattgttct tttctctcca agggtcagcc atgttaacca 240 
fttgttaatt tittctgaac agatcLgca aaagtagata atcagccaga agaattagtg 300 
cgtagtgctg aagatgatga gaaaccagat cagaagccag ttacaaatga atgcgtacca 360 
aSStcca caltgcctac acaacctgat aatccatttt ctcaccctga caaactcaaa 420 
aS^tgagca agtctgttcc agcatttcto caagatgagg tgagtggcag t^tgat^gt 480 
gtttatagtg gagactttgg caatctggaa gttaaaggaa atattcagtt tgcaattgaa 540 
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tatgtggagt cactgaagga gttgcatgtt tttgtggccc agtgtaagga cttagcagca 600 
gcggatgtaa aaaaacagcg ttcagaccca tatgtaaagg cctatttgct accagacaaa 650 
ggcaaaatgg gcaagaagaa aacactcgta gtgaagaaaa ccttgaatcc tgtgtataac 720 
^tactgc ggtataaaat tgaaaaacaa atcttaaaga cacagaaatt gaacctgtcc 780 
atttggcatc gggatacatt taagcgcaat agtttcctag gggaggtgga acttgatttg 840 
gaaacatggg actgggataa caaacagaat aaacaattga gatggtaccc tctgaagcgg 900 
iagacagcac cagttgccct tgaagcagaa aacagaggtg aaatgaaact agctctccag 960 
tatgtcccag agccagtccc tggtaaaaag cttcctacaa ctggagaagt g^acatctgg 1020 
gtgaaggaat gccttgatct accactgcta aggggaagtc atctaaattc ttttgttaaa 1080 
tgtaccatcc ttccagatac aagtaggaaa agtcgccaga agacaagagc tgtagggaaa 1140 
accaccaacc ctatcttcaa ccacactatg gtgtatgatg ggttcaggcc tgaagatctg 1200 
atggaagcct gtgtagagct tactgtctgg gaccattaca aattaaccaa ccaatttttg 1260 
ggaggtcttc gtattggctt tggaacaggt aaaagttatg ggactgaagt ggactggatg 1320 
gSctctactt cagaggaagt tgctctctgg gagaagatgg taaactcccc caatacttgg 1380 
attgaagcaa cactgcctct cagaatgctt ttgattgcca agatttccaa atgagcccaa 1440 
attccactgg ctcctccact gaaaactact aaaccggtgg aatctgatct tgaaaatctg 1500 
agtaggtgga caaatatcct cactttctat ctattgcacc taaggaatac tacacagcat 1560 
gtaaaagtca atctgcatgt gcttctttga ttacaaggcc caagggattt aaatataaca 1620 
iaatgtgtaa tttgtgactc taatattaaa taagatattt gaacaagcta ggaaaattga 1680 
atttctgctg ctgcttcaaa gaaaaagctg ccccagagca ttaaacatgg ggtattgtta 1740 
agaagcaaaa tgttcttgtt tgccatcatg tgtttcacac cacaattctg tgccacagtt 1800 
aagagggtct ggtacccttg caggaccttt gtaggttgtg ggaaaaagtc gcagaaagat 1860 
actcaaagtg gagcagggaa tggagacaga catcagtgat gataaaaaaa aaaaatggac 1920 
cttaagaaac tatttactct gtaatctcta ataaaatatg gaattccata ttagggcaat 1980 
gagactgaaa ctactggtgt ttttctgcct tgagaaaaca aacagttaaa acaagcctca 2040 
Stgtatttt agtgccaccc actggcc^ta ggtacaattc agttgttggc ttgttttgac 2100 
ttalttctaa ^taggtctc aagcctgtat ttttatgagt ttattttttt ^aaaccctgc 2160 
atatatatga ttgtttttct tataacttta ctatatgaaa gcagcataag agtagtcaca 2220 
aacatgtttt gcaacaaagt tttaattaga atgtaagttg ctcagttata ctgttcttct 2280 
tatgtatgta aaattttcgt attttgtaaa aacccttaga ataaattatc atttgattta 2340 
aattgtatta gaaaat 
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<223> Incyta ID No: 2818605CB1 

gctttgctgt tgttgctctt cggaggcggc gatccccgaa ggcgagotga aatacggctg 60 
caggctacaa tttgcagccg accattatgg atgacaagga gccgaagagg tggcccaccc 120 
tcagggaccg cttgtgctcg gatggcttct tatttcccca ataccccatt aaaccgtatc 180 
atctgaaggg gatccacaga gctgtcttct atcgtgatct ggaggaactg aagttcgttc 240 
tgctcacgcg ttatgacatc aataagagag acaggaagga aaggaccgcc ctacatttgg 300 
cctgtgccac tggccaaccg gaaatggtac atctcctggt gtccagaaga tgtgagctta 360 
acctctgcga ccgtgaagac aggacacctc tgatcaaggc tgtacaactg aggcaggagg 420 
cttgtgcaac tottctgctg caaaatggcg ccgatccaaa tattacggat gtctttggaa 480 
ggactgctct gcactacgct gtgtataatg aagatacatc catgatagaa aaacttcttt 540 
cacatggtac aaatattgaa gaatgcagca aggtataggt caaccaatgt tattttcaaa 600 
ctatctgaaa tgcatttatt ttaacattga cacatgtaag ggtcaatttt tcatatttgg 660 
aagctcaaac attccttgaa tgaaaatatt ttgaaatgcc ttaactgtct aagattttac 720 
tttaaatatt ggaactttta aagaagcatt atagggaaca gccttttttc atgcacttat 780 
ggtaaataac tataaaaaca aatgaattac aataaattta taattcatga caactgaatt 840 
tgggaaaggt aatagttaag tgtttttcca ctaaattact ttttttctaa tcagtgtgaa 900 
gtgacacagg aaagtaaaat tgtcccttat aaataggctt tattttaaat gtcaaagaaa 960 
attaaagaat ttcacaataa atgtacatgt tgttgctgtt gacaagtgtt gtatgtgaag 1020 
gtgatttcat ctga ^^^^ 
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